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PREFACE TO THE SECOND PRINTING 


The correction of a number of errors originally missed in proofreading and the 
addition of a new bibliography differentiate this printing from the first. 

Most of the errors were discovered while this book was being translated 
into Japanese by Professor Suzuki of Tokyo University, under the auspices of 
the Yamaichi Investment Trust Management Co., Ltd. I am greatly indebted 
to Professor Suzuki for these corrections, and to Tatsuo Majima, President, 
and Kazuo Kitamura, Director and Manager of the Research Department of 
Yamaichi, with whom I corresponded concerning the arrangements for, and the 
progress of, the translation. 

‘The new bibliography (pp. 308-15) was graciously supplied by Mark Rubin- 
stein, It was drawn from a larger bibliography of his which is to appear else- 
where.! Rubinstein’s annotated condensed bibliography is included here to 
provide the reader with an introduction to the extensive recent literature on 
portfolio selection. 

This literature includes works which apply the “Expected return, Variance 
of return (E, V) efficient set analysis” to areas beyond those covered in this 
book, and works which attack or defend the E, V efficient set approach. A 
brief characterization of what, to my mind, has been the impact of this litera 
ute on the contents of this book may be of interest to some readers. I will 
vol tty {0 argue my present positions here but will merely state them. Not all 
sontributions of substantial significance are even mentioned in this thumbnail 
sketch, My comments are organized according to the chapters of the book. 

 hapter 6, “Return in the Long Run,” serves to illustrate the method of 

eatic approximation which has practical und theoretical value in con- 
won with portfolio analysis, While writing the book at the Cowles Founda- 
‘wan in 1955-56, T concluded (as essentially did Latané? independently) that 
the investor who is currently reinvesting everything for “the long run” should 
maximize the expected value of the logarithm of wealth. Mossin? and 
on* have each shown that this conclusion is not true for a wide range 

ng utility to wealth at the end of the last time period, T. The 
ating Mossin-Samuelson result, combined with the straightforward argu- 


1 W Miace Ricks and Mark E. Rubinstein, eds., Portfolio Theory: Advanced Readings, 
tosh from McGraw-Hill Book Company, New York 
Henry A Latané, “Criteria for Choice among Risky Ventures,” Journal of Political 


my, April 1959) 


1 tan Mowwn, “Optimal Multiperiod Portfotio Policies,” Journal of Business, April 
ony 


Haul A Samuelson, “Risk and Uncertainty: A Fallacy of Large Numbers,” Scientia, 
Mer v6), 
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same around X» = E, In empirical tests, such as the illustrative experiment in 
Chapter 6, it repeatedly turns out that method (b) does better than (a). It now 
further turns out that method (a) is subject to the Pratt-Schlaifer and Borch 
objections, while method (b) is not. Thus—until some still better method pre- 
sents itself for estimating expected utility from the E and V of “realistic” dis- 
tributions—the use of the method of quadratic approximation, in theory and 
practice, should be confined to method (b) with Xy = E. 

My own chief theoretical worry remains, as in Chapier 13, with he gap 
between a theory based on perfect liquidity and the existence of illiquidities. 
In practice the gap is filled with plausible but ad hoc approximations. Keith 
Smith’s'® simulation analyses of portfolio adjustment procedures are im- 
Portant steps in the right directions, but this critical area remains mostly un- 
explored. 

As noted above, this brief sketch of my current general position does not at- 
tempt to mention every important recent contribution to portfolio theory and 
its application in practice. 
its applicatic Pract raiah 
Beverly Hills, California 
August, 1970 


13. Keith V. Smith, “Alternative Procedures for Revising Investment Portfolios,” 
Journal of Financial and Quantitative Analysis, December 1968. 


PREFACE 


This monograph presents techniques for the analysis of portfolios of securities. 
Although the techniques are mathematical in nature, the monograph is written 
primarily with the non-mathematician in mind. Part I discusses and illustrates 
the inputs, outputs, and objectives of a formal portfolio analysis. Part II pre- 
sents concepts and theorems needed subsequently in our exposition. Part III 
uses the prerequisites developed in Part II to go more deeply into techniques 
of portfolio analysis, Part [V, finally, discusses the theory of rational behavior 
and s applications to the selection of portfolios. 

‘The appendices of the book are for the mathematically trained reader only. 
‘Their main function is to prove certain more advanced relationships noted and 
used in the text. 

‘The mathematically trained reader may find the following suggestions help- 
ful: Part I should be read by way of motivation and illustration. Part 1I may 
be skimmed or skipped. It attempts an elementary exposition of the minimum 
requirements for the rest of the text. (Within Part II, Chapter 3 culminates in 
the formula for the expected value of a linear combination of variables, Chap- 
ter 4 culminates in the formula for the variance of a linear combination of 
random variables, Chapters 5 and 6 present certain laws of large numbers.) 

Part III presents a geometric analysis of, and computing procedures for, the 
derivation of efficient sets, Appendices A and B demonstrate that the com- 
puting procedures presented in Chapters 8 and 9 do, in fact, produce the 
desired efficient sets, Part IV, on the theory of rational behavior, presents and 
applies the utility and personal probability maxims. Appendix C continues the 
text's discussion of axiom systems for expected utility. 

I am indebted to several organizations for aid and encouragement. The 
method of analysis presented in this monograph was originally developed for 
my doctoral dissertation. This early work (1950-51) was supported by the 
Social Science Research Council and the Cowles Commission for Research in 
Economics. From August 1955 through May 1956, while on leave from the 
RAND Corporation, I did most of the writing required to transform thesis into 
monograph. During this period I was located at the Cowles Foundation for 
Research in Economics at Yale, where the writing of the monograph was made 
possible by a grant from the Merrill Foundation for Advancement of Financial 
Knowledge. To these organizations I wish to express my gratitude for in- 
tellectual and material support. 

I am also indebted to many individuals. James Tobin and Roy Radner 
read one or more drafts of many chapters and provided valuable advice with 
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respect to both content and exposition. Mrs. Markowitz read drafts of several 
chapters and provided helpful suggestions concerning exposition for the non- 
mathematician. 

The content of Part IV of this book reflects a series of conversations with 
Gerard Debreu. The content of Part I reflects sessions with Horace F. Isleib, 
Investment Officer of Yale University, and Ralph W. Halsey, Jr., Assistant 
Investment Officer. 

This monograph benefited from the diligence of a number of people: Ewing 
Jackson Webb, who prepared the inputs to the ten-security example; Harold 
Watts, Robert Z. Aliber, and Leroy §. Wehrle, who proofed the final draft 
for the Cowles Foundation; Mrs, Natalie Sirkin, who did the hard part of pre- 
paring the bibliography; and Miss Althea Strauss, who efficiently supervised 
the typing of two or three drafts of each chapter. 

While the afore-mentioned individuals and organizations have aided im- 
measurably in the writing of this book, all opinions and any errors contained 
herein are, of course, my own responsibility. 


Harry M. MARKOWITZ 
New York City 
February, 1959 


PARTI 


INTRODUCTION 
AND ILLUSTRATIONS 


Cuarrer I 


INTRODUCTION 


THE ANALYSIS OF PORTFOLIOS 


‘This monograph is concerned with the analysis of portfolios containing 
large numbers of securities. Throughout we speak of “portfolio selection” 
rather than “security selection.” A good portfolio is more than a long 
list of good stocks and bonds. It is a balanced whole, providing the 
investor with protections and opportunities with respect to a wide range of 
contingencies. The investor should build toward an integrated portfolio 
which best suits his needs. This monograph presents techniques of 
Portfolio Analysis directed toward determining a most suitable portfolio 
for the large private or institutional investor. 

A portfolio analysis starts with information concerning individual 
securities. It ends with conclusions concerning portfolios as a whole. 
The purpose of the analysis is to find portfolios which best meet the 
objectives of the investor. 

Various types of information concerning securities can be used as the 
raw material of a portfolio analysis. One source of information is the 
past performance of individual securities. A second source of information 
is the beliefs of one or more security analysts concerning future perform- 
ances. When past performances of securities are used as inputs, the 
outputs of the analysis are portfolios which performed particularly well in 
the past. When beliefs of security analysts are used as inputs, the outputs 
of the analysis are the implications of these beliefs for better and worse 
portfolios, 

This introductory chapter discusses broad principles upon which the 
techniques of portfolio analysis are based. The next chapter discusses the 
inputs, outputs, and objectives of illustrative portfolio analyses. Subse- 
quent parts of the monograph go more deeply into the techniques by which 
information concerning securities is transformed into conclusions con- 
cerning portfolios, 


4 PORTFOLIO SELECTION 
THE UNCERTAINTY OF SECURITY RETURNS 


Uncertainty is a salient feature of security investment. Economic 
forces are not understood well enough for predictions to be beyond doubt 
or error. Even if the consequences of economic conditions were under- 
stood perfectly, non-economic influences can change the course of general 
Prosperity, the level of the market, or the success of a particular security. 
The health of the President, changes in international tensions, increases or 
decreases in military spending, an extremely dry summer, the success of an 
invention, the miscalculation of a business management—all can affect the 
capital gains or dividends of one or many securities. 

We are expecting too much if we require the security analyst to predict 
with certainty whether a typical security will increase or decrease in value. 
Even if he could assemble all information, including information available 
only to the managers of the corporation and information available only to 
its competitors, the security analyst might still be forced to conclusions 
such as: 


This security may be expected to do well if securities in general do well. It 
must be expected to do poorly if securities in general do poorly. Even this 
following of the market is not certain. There are weaknesses which may cause 
it to do poorly even though securities in general are performing well: The 
possibility of a labor dispute or of an aggressive competitor cannot be ignored, 
‘On the other hand, there are potentialities which may bring success greater than 
even the corporation management dares hope. The new styling of the product, 
the (not inexpensive) advertising campaign, and the expansion of production 
facilities may prove to be a magic combination, fulfilling all expectations 
for it. 


Only the clairvoyant could hope to predict with certainty. Clairvoyant 
analysts have no need for the techniques of this monograph. 

‘The existence of uncertainty does not mean that careful security analyses 
are valueless. The security analyst may be expected to arrive at reasonable 
opinions to the effect that: 


The return (including capital gains and dividends) on security A is less 
uncertain than that on security B; the return on security C is more closely 
connected to the course of the general market than is that on security D; the 
growth of security E is more certain but has less potential than that of security 
F; only if the demand for their industry's product continues to expand (as it 
is likely, but not certain, to do) will the return on securities G and H be 
satisfactory. 


Carefully and expertly formed judgments concerning the potentialities and 
weaknesses of securities form the best basis upon which to analyze port- 
folios. 


INTRODUCTION 3 
CORRELATION AMONG SECURITY RETURNS 


A second salient feature of security investment is the correlation among 
security returns. Like most economic quantities, the returns on securities 
tend to move up and down together. This correlation is not perfect: 
individual securities and entire industries have at times moved against the 
general flow of prosperity. On the whole, however, economic good and 
ill tend to spread, causing periods of generally high or generally low 
economic activity. 

If security returns were not correlated, diversification could eliminate 
risk. It would be like flipping a large number of coins: we cannot predict 
with confidence the outcome ofa single flip; but if 2 great many coins are 
flipped we can be virtually sure that heads will appear on approximately 
one-half of them. Such canceling out of chance events provides stability 
to the disbursements of insurance companies. Correlations among 
security returns, however, prevent a similar canceling out of highs and 
lows within the security market. It is somewhat as if 100 coins, about to 
be flipped, agreed among themselves to fall, heads or tails, exactly as the 
first coin falls. In this case there is perfect correlation among outcomes. 
The average outcome of the 100 flips is no more certain than the outcome 
of asingle flip. If correlation among security returns were “perfect” —if 
returns on all securities moved up and down together in perfect unison— 
diversification could do nothing to eliminate risk. The fact that security 
returns are highly correlated, but not perfectly correlated, implies that 
diversification can reduce risk but not eliminate it. 

The correlation among returns is not the same for all securities. We 
generally expect the returns on a security to be more correlated with those 
in the same industry than those of unrelated industries. Business con- 
nections among corporations, the fact that they service the same area, a 
common dependence on military expenditures, building activity, or the 
weather can increase the tendency of particular returns to move up and 
down together, 

To reduce risk it is necessary to avoid a portfolio whose securities are all 
highly correlated with each other. One hundred securities whose returns 
tise and fall in near unison afford little more protection than the uncertain 
return of a single security. 


OBJECTIVES OF A PORTFOLIO ANALYSIS 


It is impossible to derive all possible conclusions concerning portfolios. 
A portfolio analysis must be based on criteria which serve as a guide to the 
important and unimportant, the relevant and irrelevant. 
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The proper choice of criteria depends on the nature of the investor. 
For some investors, taxes are a prime consideration; for others, such as 
non-profit corporations, they are irrelevant. Institutional considerations, 
legal restrictions, relationships between portfolio returns and the cost of 
living may be important to one investor and not to another. For each 
type of investor the details of the portfolio analysis must be suitably 
selected. 

Two objectives, however, are common to all investors for which the 
techniques of this monograph are designed: 


1. They want “return” to be high. The appropriate definition of 
“return” may vary from investor to investor. But, in whatever sense is 
appropriate, they prefer more of it to less of it. 

2. They want this return to be dependable, stable, not subject to un- 
certainty. No doubt there are security purchasers who prefer uncertainty, 
like bettors at a horse race who pay to take chances. The techniques in 
this monograph are not for such speculators. The techniques are for the 
investor who, other things being equal, prefers certainty to uncertainty. 


The portfolio with highest “likely return” is not necessarily the one with 
least “uncertainty of return."! The most reliable portfolio with an 
extremely high likely return may be subject to an unacceptably high degree 
of uncertainty, The portfolio with the least uncertainty may have an 
undesirably small “likely return." Between these extremes would lie 
portfolios with varying degrees of likely return and uncertainty. 

If portfolio A has both a higher likely return and a lower uncertainty of 
return than portfolio B and meets the other requirements of the investor, 
it is clearly better than portfolio B. Portfolio B may be eliminated from 
consideration, since it yields less return with greater uncertainty than does 
another available portfolio. We refer to portfolio B as “inefficient.” 
After eliminating all such inefficient portfolios—all such portfolios which 
are clearly inferior to other available portfolios—we are left with portfolios 
which we shall refer to as “efficient.” These consist of: the portfolio with 
less uncertainty than any other with a 6% likely return, the portfolio with 
less uncertainty than any other with a 7% likely return, and so on. It 
cannot be said of two efficient portfolios “the first is clearly better than the 
second since it has a larger likely return and less uncertainty.” All such 
cases have been eliminated. 

The proper choice among efficient portfolios depends on the willingness 
and ability of the investor to assume risk. If safety is of extreme impor- 
tance, “likely return” must be sacrificed to decrease uncertainty. If a 


1 In Ialer chapters we must give precise definitions (0 terms such as “likely” and 
“uncertainty.” For the present we may leave them as rough, intuitive concepts. 
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greater degree of uncertainty can be borne, a greater level of likely return 
can be obtained. An analysis of the type presented in this monograph: 


first, separates efficient from inefficient portfolios; 

second, portrays the combinations of likely return and uncertainty of 
return available from efficient portfolios; 

third, has the investor or investment manager carefully select the 
combination of likely return and uncertainty that best suits his needs; and 

fourth, determines the portfolio which provides this most suitable 
combination of risk and return, 


Chapter Il 


ILLUSTRATIVE PORTFOLIO ANALYSES 


INPUTS TO AN ILLUSTRATIVE PORTFOLIO ANALYSIS 


The nature and objectives of portfolio analyses may be illustrated by a 
small example concerned with portfolios made of one or more of nine 
common stocks and cash. The nine securities, listed in Figures 1a to li, 
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Figure la, Returns on security 1, American Tobacco, Common. 


include a utility, a railroad, a large and a small steel company, and several 
other manufacturing corporations. Cash is included in the analysis as a 
tenth “security.” No special significance should be attached to this list 
of securities other than that it will be used in illustrating principles of 
portfolio analysis. 

An actual portfolio analysis would start from a much longer list of 
promising securities. Not all these securities would appear in the final 
desirable portfolio. They enter the analysis as candidates for a place in 
the desirable portfolio. 
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Figure 1b. Returns on security 2, American Tel. & Tel., Common. 
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Figure le, Retums on security 3, United States Steel, Common. 
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Figure Ie, Returns on security 5, Atchison, Topeka & Santa Fe. 
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Figure 1g. Returns on security 7, Borden, Common. 
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Returns on security 9, Sharon Steel, Common. 


Figure ti, 


The return during a year is defined 


‘The returns on the nine securities, during the years 1937-54, are presented 


in Table 1 and illustrated in Figure !. 


to be 


(the closing price for the year) minus 


(the closing price for the previous year) plus 
(the dividends for the year) all divided by 
(the closing price of the previous year). 
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For example, the return in 1948 is 


(closing price, 1948) — (closing price, 1947) + (dividends, 1948) 
(closing price, 1947) 


This is the amount which an investor would have made or lost if he invested 
$1.00 at the end of 1947, collected the dividends declared in 1948, and sold 
at the closing price of 1948. A loss is represented by a negative return. 
For example, if the closing price of 1947 were 50, that of 1948 were 45, and 
$2 of dividends were declared during 1948, then the return in 1948 would be 


45—50+2 
50 


or a loss of 6% per dollar invested. 

Our example portfolio analysis will consider performances of portfolios 
with respect to “return” thus defined. This assumes that a dollar of 
realized or unrealized capital gains is exactly equivalent to a dollar of 
dividends, no better and no worse. This assumption is appropriate for 
certain investors, for example, some types of tax-free institutions, Other 
ways of handling capital gains and dividends, which are appropriate for 
other investors, are discussed later. 

Our nine securities differed in the amount of return which they yielded 
on the average. For example, the average of the annual returns on 
United States Steel Common Stock was 14.6 cents per dollar invested; 
that on Coca-Cola Common was 5.5 cents per dollar invested. On the 
average! the return on U.S. Steel was higher than that on Coca-Cola, 

Securities also differ with respect to their stability of return. For 
example, the greatest loss incurred on A. T. & T. was 18 cents per dollar 
invested (in 1941). On the other hand, the greatest loss on Sharon Steel 
was 43 cents per dollar invested (in 1937). In three other years Sharon 
Steel showed losses exceeding 20 cents per dollar. Clearly, A. T. & T. 
showed less variability of return than did Sharon Steel. 

Portfolio selection should be based on reasonable beliefs about future 
rather than past performances per se. Choice based on past performances 
alone assumes, in effect, that average returns of the past are good estimates 
of the “likely” return in the future; and variability of return in the past is 
a good measure of the uncertainty of return in the future. Later we shall 
see how considerations other than past performances can be introduced 
into a portfolio analysis. For the present it is convenient to discuss an 
analysis based on past performances alone. 


= -.06, 


2 There are various ways of averaging set of numbers. We shall use the “ordinary” 
average, obtained in this case by adding together the eighteen numbers and dividing by 
eighteen, 


ILLUSTRATIVE PORTFOLIO ANALYSES 15 


Suppose that a portfolio consisted of 20 cents’ worth of Atchison, 
Topeka & Santa Fe per dollar invested, plus 80 cents’ worth of Coca-Cola 
per dollar invested. The return in 1954 on such a portfolio would be 


(.2) times (the return of A. T. & Sfe in 1954) plus 
(.8) times (the return of Coca-Cola in 1954) 


= (.2-469) + (.8(.077) 
= 155. 


Return can be calculated similarly for any combination of securities in 
any year. 

The average return on the portfolio consisting of 80% Coca-Cola and 
20% A. T. & Sfe was equal to 


(.8) times (the average return on Coca-Cola) plus 
(.2) times (the average return on A. T. & Sfe) 


= (84.055) + (.2)(.198) 
= .084. 


This is higher than the average return on Coca-Cola and lower than the 
average return on A. T. & Sfe. Inevitably the average return on a port- 
folio lies somewhere between the highest and the lowest average return on 
the securities contained in the portfolio. 

One might conjecture that the variability of return on a portfolio can, 
similarly, be no smaller than that of the least variable security in the 
portfoiio. But this is not so. The return on A. T. & Sfe was rather 
unstable during the period 1937-54 (showing a maximum loss of 45 cents 
on the dollar), The return on Coca-Cola was more stable, showing a 
Maximum loss of only 25 cents. The return on the 80%-20 % combination 
of Coca-Cola and A. T. & Sfe, respectively, was still more stable, Its 
maximum loss was only 18 cents on the dollar, In Figure 2 we have 
plotted the annual returns on the portfolio consisting of 80 cents Coca-Cola, 
20 cents A. T. & Sfe. For comparison we have also plotted the return on 
Coca-Cola. 

“Largest loss” is not the only possible measure of variability, Another 
Measure, better for our purposes, is discussed later. In terms of this 
measure also, the variability of A. T. & Sfe is greater than that of Coca- 
Cola, while that of Coca-Cola is, nevertheless, greater than that of the 
Portfolio. For the present we assume that Figure 2 and the reader's eye 
Sonfirm the statement that the variability of the particular portfolio was 
Jess than that of either of the securities contained in it. 

Our 20%,-80% portfolio had both a higher average return and a lower 
variability of return than a portfolio consisting of 100% Coca-Cola. On 
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Figure 2. Returns on Coca-Cola Co. Common Stock and on a mixture of two 
securities (solid curve, Coca-Cola; dash curve, 80% C.C. + 20% A. T. & Sfe). 
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(b) 


Figure 3, Two hypothetical series. 
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the whole, the “diversified” portfolio was both more profitable and more 
stable than Coca-Cola alone. One might wonder whether or not there was 
some other portfolio—some other combination of our ten securities (nine 
securities and cash}—which had both greater average return and greater 
stability than even the 20%-80% mixture. Or perhaps there was a 
portfolio with greater average return and the same stability; or greater 
stability and the same average return. 

Before we can discuss such questions, we must settle on some particular 
measure of the variability of return on a portfolio. “Greatest loss” is a 
possible measure, but not a good one for our purposes. For example, 
fails to distinguish between a security whose pattern of returns is shown in 
Figure 3a from one whose pattern of return is shown in 3b. In Chapter 
VIII various measures of variability are evaluated in terms of basic 
principles of behavior under uncertainty. The discussion there confirms 
that “maximum loss” is not a desirable measure for us. 

A better measure is the standard deviation, frequently used in statistics 
And statistical applications in such diverse fields as economics, psychology, 
and astronomy. The next section describes the standard deviation. 
‘The reader may skim or skip the details of this section since subsequent 
sections require only the knowledge that the standard deviation is a 
measure of variability. 


THE STANDARD DEVIATION 


The definition of the average of eighteen numbers is, in effect, a set of 
Computing instructions. It says “add together the eighteen numbers and 
divide by eighteen.” The definition of a standard deviation is also a set of 
Computing instructions, albeit a more complicated set. 

We begin with a series of numbers such as the returns on security 1 
{column 2, Table 2). From each number we subtract the average: 


—.305 — 066 = —.371, 
513 — .066 = 447, 
055 — .066 = —.011, etc. 


' lis frequent use in other fields does not prove that the standard deviation is a good 
Meniure for evaluating portfolios, In fact, the reasons for its use in statistics differ 
from those which justify its use in the evaluation of portfolios. Its use in statistics is 
Mequently due to its connection with a particular “bell-shaped” or “normal” curve 
Which describes the probabilities associated with a variety of chance events, Its 
Jwsiilleation in the evaluation of portfolios is connected with the fact that, for conserva~ 

investors, a loss of 21 dollars is more than twice as bad as a loss of L dollars; while 
* mrin of 2G dollars is not quite twice as good as a gain of G dollars, 
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Taste 2 
COMPUTATION OF STANDARD DEVIATION 


Deviations Squared 

Year | Returns from | pevistions 

Average 

1 — 305 -3n 137641 
2 513 447 .199809 
3 055 —O1 000121 
4 126 —192 | 036864 
5 — 280 — 346 | 119716 
6 —003 — 069 004761 
7 428 362 131044 
8 192 126 | 015876 
9 446 380 | 144400 
10 —.088 —154 | 023716 
n =.193 037249 
2 — 081 006561 
13 239 057121 
4 = 162 026244 
15 —050 | 002500 
16 128 062 003844 
7 —010 —076 | 005776 
18 154 088 007744 
Average| 066 000 | 053333 


Standard Deviation = V.0533 = 231. 


We now have eighteen new numbers known as “deviations from the aver- 
age” (column 3, Table 2). The next step is to square these deviations; 
(-371)? = (=.371(—.371) 
= 137641, 
(447)? = 199809, 
(—.011)% = .000121, etc., 


giving us eighteen squared deviations from the average (column 4 in 
Table 2). 


—_ 
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Next we find the average of the squared deviations: 


«137641 + .199809 + + + + + ,007744 


73 = 053333. 


‘This average squared deviation is called the variance of the numbers. The 
standard deviation, finally, is the square root of the variance. Thus the 
Standard deviation of the returns on security 1 is 


V.0533 = .231, 
In short, the standard deviation is 
the square root of the average squared deviation. 


‘The standard deviation of return on a portfolio is not determined solely 
by the standard deviations of its individual securities. It also depends on 
the correlations between securities. The “correlation coefficient” measures 
the extent to which two series of numbers tend to move up and down 
together. If they move up and down in perfect unison, the correlation 
Coefficient is 1. If the rise (or fall) of one makes it no more or no less 
likely that the other will rise (or fall), then their correlation coefficient is 
zero: they are uncorrelated. The more the two series of numbers tend to 
move up and down together, the greater is their correlation coefficient. 
(The exact definition of the correlation coefficient is presented in a 
subsequent chapter.) 

The standard deviation of a portfolio is determined by 


(a) the standard deviation of each security, 


(b) the correlation between each pair of securities, and, of course, 
(c) the amount invested in each security. 


Once (a), (b), and (c) are known, the standard deviation of the portfolio 
fan be computed, Other things being equal, the higher the correlations 
among security returns, the greater is the standard deviation of the port- 
folio as a whole. To put it another way: the more the returns on indi- 
vidual securities tend to move up and down together, the less do variations 
in individual securities “cancel out” each other; hence the greater is the 
variability of return on the portfolio. 


OUTPUTS OF THE ANALYSIS 


Figure 4 shows the average return and standard deviation of return on 
(he securities in Table 1, The horizontal axis represents the average 
felurn; the vertical axis represents the standard deviation. Thus the 
Point labelled 1 indicates that security | (American Tobacco Common 
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Stock) had an average return of .066 (6.6 cents per dollar) and a standard 
deviation of return of .23, The point P represents the average and the 
standard deviation of return on the portfolio with 20 cents of A. T. & Sfe 
and 80 cents Coca-Cola per dollar invested. Our tenth security, cash, has 
a zero average and a zero standard deviation of return. 
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Figure 4. Some obtainable combinations—average and standard deviation. 


We see from Figure 4 that A. T. & T. (security 2) had about the same 
return on the average and a much lower standard deviation than either 
security | or 6. Clearly, security 2 performed better during the period 
than did 1 or 6, combining as high an average with greater stability, 

Portfolio P, as noted before, had both a higher average and a lower 
standard deviation than security 6. Security 7 had a still higher average 
and lower standard deviation than portfolio P. 

Is there a portfolio which had the same average return as security 2 but 
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had a smaller standard deviation? Is there a portfolio which had the same 
standard deviation as security 2 but had a higher average return? Is there 
4 portfolio which had the same average return as security 7 but a smaller 
standard deviation? Is there a portfolio which had both a higher average 
and a lower standard deviation than did security 47 

The answer to all these questions is “yes.” The curve in Figure 4 
indicates the smallest standard deviation obtainable with eath level of 
average return. It indicates, for example, that there was a portfolio with 


an average return of 5 cents (,05 dollar) per dollar invested and 
a standard deviation of slightly Jess than 6 cents per dollar invested. 


No portfolio with an average return of $ cents or more had a lower 
standard deviation, Similarly there was a portfolio with 


an average return of 10 cents per dollar invested, and 
a standard deviation of slightly more than 11 cents per dollar invested. 


No portfolio with this much average return had a lower standard deviation. 
‘The curve in Figure 4 was derived from 


(a) the average returns of the individual securities, 
(b) the standard deviations of the individual securities, and 
(c) the correlations between each pair of securities. 


‘The procedures by which such a curve is obtained are discussed in 
Chapter VII? 

Comparing the curve with the numbered points, we see that there was a 
portfolio with the same average return as A. T. & T. (security 2) but with 
litle more than 1/2 the standard deviation. There was a portfolio with 
the same standard deviation but with about 66% more return on the 
average than security 2, There were portfolios with both slightly more 
Average return and slightly less standard deviation than security 7. There 
Was 8 portfolio with the same average return as security 4 but with much 
less standard deviation; and one with the same standard deviation as 
‘ourity 4 but with slightly more average return. 

We can divide portfolios into two groups: 


(1) those whose average return and standard deviation of return are 
fepresented by a point on the curve in Figure 4, and 


' A curve such as that in Figure 4 is drawn on the basis of some assumption about 

Wlimate” portfolios. The analysis behind the curve of Figure 4 did not permit 
owing (€.g-, buying on margin) or short selling. The portfolio was otherwise 
Wieriricted, Different assumptions about “legitimate” portfolios are appropriate for 
Wifferent investors, 
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(2) those whose average return and standard deviation of return would 
be represented by a point above the curve. 


No portfolio has an average return and standard deviation which would be 
represented by a point below the curve.’ 

A portfolio of the second sort, not on the curve, is called inefficient. 
If a portfolio is inefficient, there is either some other portfolio with more 
average return and no more standard deviation, or else some portfolio 
with less standard deviation and no less average return. In the case of 
most inefficient portfolios there are portfolios which have both more 
average return and less standard deviation. 

Thus the portfolio consisting entirely of security 6 is inefficient because 
portfolio P has more average and less standard deviation; portfolio P is 
inefficient because portfolio consisting entirely of security 7 has more 
average and less standard deviation; while the portfolio consisting entirely 
of security 7 is inefficient because there is a portfolio represented by some 
point on the curve (e.g., the point with an average of .14 and a standard 
deviation of about .16) which has still more average return and still less 
standard deviation. 

If a portfolio is represented by a point on the curve, it is called efficient. 
If 4 portfolio is “efficient,” it is impossible to obtain a greater average 
return without incurring greater standard deviation; it is impossible to 
‘obtain smaller standard deviation without giving up return on the average. 
For example, the efficient portfolio with an average return of .1 has a 
standard deviation of return of slightly more than .11. If we wanted an 
average return of .14 we should have to accept a standard deviation of 
about .16; if we wanted a standard deviation of .08 we should have to 
accept an average return of .07. ; 

Suppose we believed that past averages and standard deviations were 
reasonable indicators of “most likely” return and “uncertainty” of return 
in the fulure. Figure 4 would indicate combinations of “most likely” 
return and “uncertainty” of return obtainable from portfolios. We would 
not want an inefficient portfolio, because we could obtain greater return 
with greater certainty by choosing an efficient one, Our “chosen’? 
portfolio would be an efficient portfolio. : 

Judgment must be employed in choosing one of the set of all efficient 
portfolios. The “investor” must contemplate the various efficient com- 
binations of average return and standard deviation. He must decide 
whether it is better for him to select a portfolio with, for example, 


an average return of .04 and a standard deviation of .045, or one with 


That is, no “legitimate” portfolio (in the sense of the footnote on page 21) has an 
average return and standard deviation represented by a point below the curve, 
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an average return of .10 and a standard deviation of .113, or one with 
an ayerage return of .14 and a standard deviation of .16. 


The “inyestor” must choose one combination of average and standard 
deviation which, more than any other, satisfies his needs and preferences 
with respect to risk and return. 

Once the “investor” chooses among these efficient combinations of 
average return and standard deviation, the analysis can indicate a portfolio 
which gives rise to the chosen combination. If he decides on an average 
feturn of .07 and a standard deviation of .08, the portfolio analysis indicates 
that the corresponding portfolio has 


8 cents, security 3 
1.5 cents, security 4 
6 cents, security 5} per dollar of portfolio. 
44.5 cents, security 7 
50 cents, security 10, 


If the “investor” is curious about the efficient Portfolio with an average 
feturn of .175, the analysis indicates that this has 


33. cents, security 4 
45.5 cents, security 5 per dollar of portfolio. 
21.5 cents, security 7. 


Among the efficient portfolios are special ones referred to as corner 
portfolios. Once we know the corner portfolios, we can easily derive the 
Portfolio associated with any efficient combination of average return and 
Mandard deviation. 

The nature and significance of the corner portfolios can be illustrated 
geometrically by an example involving three securities. The curve in 
Figure 4 relates standard deviation to average return. It does not portray 
the amounts invested in each security. Such a portrayal is difficult when 
ten securities are involved. It is a simple matter for three securities. In 
Vigure 5 the horizontal axis represents the amount invested in a first 
feourity. The vertical axis represents the amount invested in a second 
‘ourity. Thus the point P represents a security with 25% invested in 
eourity 1 and $0% invested in security 2. The amount invested in security 
) must be 25%, since the amounts invested in the three securities must add 
10 100%. 

The heavy line shows how the set of efficient portfolios can look in a 
three-security analysis. In the present example every portfolio represented 
bya point on the heavy lineis efficient, Any portfolio not thus represented 
#} Not efficient. The locus of points portraying efficient Portfolios starts at 
ihe point a, whose portfolio has smallest variance. The locus moves in a 
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straight line from a until it reaches the point b; there it turns and moves in 
another straight line until it reaches c; there it turns again and moves in a 
straight line until it reaches the point d, whose portfolio has largest average 
return, If the points a, 5, c, and dare known, the other points Tepresenting 
efficient portfolios can be inferred. The points a, b, c, and d represent 
the corner portfolios of the present three-security example. 


8 


& invested 1n security 2 


% invested in security 1 


Figure 5, Efficient portfolios among three securities. 


Our ten-security analysis has 7 corner portfolios, as listed in Table 3. 
For example, the third corner portfolio consists of 


8 cents, security 4° 
92 cents, security 5. 
and had an average return of .196. 

Each corner portfolio is efficient: its average and standard deviation is 
represented by a point on the curve in Figure 4, Any “weighted average” 
of consecutive corner portfolios is also efficient. For example, let us take 
a weighted average of the fourth and fifth corner portfolios, using the 
weights 1/4 and 3/4. We get a new portfolio with 

% invested in security 4 = (1/4)(41) + (3/4)(25) 
= 29, 
%, invested in security 5 = (1/4)(59) + (3/4\(32) 
= 39, 
% invested in security 7 = (1/4)(0) + (3/4)(43) 
= 32, 
average return = (1/4)(.188) + (3)4)(.162) 
= 169. 


} per dollar of portfolio 


Taste 3 
CORNER PORTFOLIOS 
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Securities 1, 2, 3, 6, 8, 9, 10 do not appear in either the fourth or the fifth 
corner portfolio and therefore do not appear in the weighted average of the 
two. The new portfolio, calculated above, is efficient, 

If we take a weighted average of the second and fourth corner portfolios, 
we do not get an efficient portfolio. We must ust consecutive portfolios 
such as the first and second, or second and third, etc. The two weights 
used (like 3/4 and 1/4) must be between 0 and 1 and must add up to |. 

The efficient portfolio with an average return of .19 is a weighted average 
of the third and fourth corner portfolios. This is indicated by the fact 
that the third corner portfolio has a higher, and the fourth corner portfolio 
has a lower average return than .19. Similarly, the efficient portfolio with 
an average return of .15 isa weighted average of the fifth and sixth corner 
portfolios; the efficient portfolio with an average return of .08 is a 
weighted average of the sixth and seventh corner portfolios. 

Suppose we wished to find the efficient portfolio with an average return 
of 15. This lies between the fifth and sixth corner portfolios, which have 
average returns of .162 and .140 respectively. We must find weights, 
wand 1 — Ww, so that 


(162) + (1 = w(.140) = 150. 


Solving for w, we get 
150 — 4140 
162 — .140 


l-wer 
Thus the efficient portfolio with an average return of .150 has 

invested in security 3 = (5/11)(0) + (6/11)(16) = 9, 

invested in security 4 = (5/11)(25) + (6/11)3) = 13, 

invested in security 5 = (5/11)(32) + (6/1112) = 21, 

invested in security 7 = (5/1143) + (6/1169) = 57. 
The efficient portfolio associated with any combination of average return 


and standard deviation of return on the curve in Figure 4 can be found 
similarly. 


Propaaitity Beviers AND PoRTFOLIOS 


The inputs to the analysis discussed above were past performances of 
individual securities; the outputs were statements about performances of 
portfolios. Portfolio selection based solely on such an analysis assumes, 
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in effect, that past averages and standard deviations are reasonable 
measures of the “likely” return and the uncertainty of return in the future. 
‘The present section considers a second type of input to a portfolio 
analysis. A third type is touched on in the following section. 
Rather than using past performances per se, we could use the “proba- 
bility beliefs” of experts as inputs to a portfolio analysis. This raises 
three questions: 


What is a probability belief? 
What is an expert? 
How do we get the former from the latter? 


The nature of probability belief can be illustrated by 2 very hypothetical 
txample. For this example we need two “props.” The first “prop” is a 
large “wheel of fortune” marked with numbers | through 100. The 
wheel is perfectly balanced and impeccably honest. Thus there is exactly 
4.01 probability (1 chance in 100) that the number 1 will be the result of a 
spin of the wheel. Similarly, there is exactly a .02 probability (2 chances 
in 100) that either the number | or else the number 2 will appear; and a 

16 probability that one of the first sixteen numbers will appear. 

The second “prop” is a rich but eccentric uncle of yours who has willed 
you a chance to win a large fortune. You even have a choice as to the 
hind of chance situation in which to engage. Specifically you must choose 
between the following two alternatives: 


Alternative 1. The wheel marked with the numbers 1 to 100 will be 
spun. Ifany number from | to 80 appears, you win the fortune. Other- 
wise you are thanked for your cooperation and the fortune goes to the care 
Wf ged cats. Alternative 1 is subject to a .2 probability of losing the 
fortune. 

Alternative 2. You win the money if it does not rain tamorrow. Ifa 
{ince of rain is reported by the local weather station, the money goes to 
‘We cats. 


If you prefer alternative 2 to alternative 1, then the probability belief 
you attach to rain tomorrow is less than .2. If you prefer alternative 1, 
‘en the probability belief you attach to rain is greater than .2. If 
jou are indifferent between the two alternatives, your probability belief 
eval 2 

The choice of alternative 1 or 2 would depend on considerations such as 


' We may suppose that the alternatives are set up so as to minimize extraneous 
01 such as differences in “suspense” and possible “regret.” For example, which- 
#41 sllernative is chosen, the wheel is spun and the outcome of each alternative is 
Winwunced, Other details are left to the reader's imagination, 
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cloud formation, temperature, humidity, the weather forecast if you 
happened to see it or were allowed to look at it, experience in the past 
(especially on days similar to the present day). If possible, you might 
consult the records of the weather bureau. Or perhaps you could consult 
a meteorologist who could better judge the possibilities latent in the 
current weather picture. 

Usually the meteorologist is expected to make a prediction: to tell 
whether rain or no rain is more likely. In the present example the 
meteorologist is asked to advise on the probability belief which should be 
atlached to “rain tomorrow.” We need not restrict ourselves to mythical 
situations involving eccentric uncles for questions of probability belief to 
be relevant. A higher probability of “right” weather conditions is 
required before proceeding with a nuclear weapons test than with a picnic 
excursion, while a higher probability is required for the picnic excursion 
than for hanging the family wash. 

The Security Analyst is the meteorologist of stocks and bonds. If he 
is thorough, his statements about the future of a security will be based on 
general conditions and prospects for the economy and the market; the 
nature of possible new developments in the industry; the past performance, 
financial structure, and other matters relating to the opportunities and 
dangers confronting the corporation; and, finally, the position of the 
particular security vis-a-vis others of the corporation 

We shall not discuss the procedures of the security analyst in arriving 
at reasonable beliefs about securities. Works on security analysis are 
available? The topic of this monograph is Portfolio Analysis. A 
portfolio analysis begins where security analyses leave off. 

The relationship between portfolio analysis and security analysis may 
be illustrated by a particular portfolio analysis based on the probability 
beliefs of security analysts, These beliefs were recorded on forms such as 
those in Figures 6 and 7. The information recorded was the output of 
security analyses and the input to a portfolio analysis. The information 
required for the portfolio analysis depended on 


(1) the objectives of the investor and 

(2) the need of the portfolio analysis for estimates of the most likely 
return on each security, the uncertainty of return associated with each 
security, and the correlation between each pair of securities, 


The investor, in the example under discussion, was a tax-exempt institution. 
Long-standing higher policy restrained the investor from using capital 
gains for current expenditures. This policy was accepted in the portfolio 


* See, for example, Graham and Dodd, Security Analysis (2) 
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Tee of Saini TGurrent Vie) 


RELATION OF SECURITY'S RETURN TO 
STANDARD & POOR'S INDEX 


If the Standard & Poor 
Index changes during 
the next three years 


byreces 
+90% +55 +20% -15%  —50% 
High | 
Then we may 
expect of the Likely 
security ee 
Correlations 


List securities which are other than “normally” correlated with the above 
security. (See attached sheet! for definitions of “normally correlated,” HH, H, 
L, and LL.) 


HH 


H 


N (It is unnecessary to list securities in this category. All securities not 


listed elsewhere will be assumed to be here.) 


L 


LL 


Remarks: 


+ Discussed in text, 
Figure 7, Form 2. 
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Analysis as an immutable fact of life. It was decided that a desirable 
portfolio 


(i) would yield at least a specified current income,’ to provide for 
current expenses, and 

(ii) would grow in value (from additional income or capital gains) 
during the next three years, to provide the basis of future income and 
growth. 


Ii was decided that “current income” was sufficiently predictable that 
Uncertainty in this area could be ignored. Uncertainty of future growth, 
however, could not be ignored. 

The portfolio analysis produced a set of “efficient portfolios” similar to 
those discussed in the last section. Each efficient portfolio gives minimum 
“uncertainty of growth” for given “most likely growth.” Each efficient 
portfolio also meets the requirement for current yield. 

Two forms were used to record beliefs about securities. The first form, 
Vigure 6, is concerned with the performance of the Standard & Poor 
Index of 480 Common Stocks during the next three years (May 1956— 
May 1959). It asks that “probabilities” be assigned the statements: 


The Index will rise by at least 75% during the three years, 
The Index will rise by at least 40% but less than 75%, 
The Index will rise by less than 40% 

The Index will fall by less than 25%. 

The Index will fall by more than 259%, 


The actual performances during all such three-year periods from 1871 to 
1955 were presented. Thus the arrow labeled 1929 indicates that there 
Jens over a 100% increase in the Index from May 1925 to May 1929. The 
Soluma labeled “frequency” (of occurrence) indicates, for example, that 
§ three-year gain of more than 75% occurred $ times during the period. 
The column labeled “percentage” (of occurrence) indicates that this 
frequency of 5 represents 6% of the total 82 observations. 

The information on past performances was presented for the convenience 
| the security analysts, to be considered or ignored as they saw fit. The 
fobability beliefs recorded in the last column represented a concensus 
/yneerning the (then) future performance of the market. 

One copy of the second form, Figure 7, was filled out for each security 
in the analysis. The form provides space for 


(1) name of security, 
{2) estimated current income, 


! Interest and di 


nds during the coming year. 


32 PORTFOLIO SELECTION 


(3) answers to questions such as “Suppose the Standard & Poor Index 
rises by 20%, what do you consider to be the most likely return on this 
security? What do you consider to be a “high” return? A low return? 
Similar questions were asked for other changes in the Index. 

(4) Indication that the security in question is “unusually” correlated 
with particular other securities in the analysis. 


A “high” return is defined to be a return so high that the analyst feels 
there is but one chance in twenty that a higher return would occur. A 
“Jow" return is defined to be a return so low that the analyst feels there is 
but one chance in twenty that a lower return will occur. Thus there is one 
chance in ten that the return will be either higher than “the high” or lower 
than “the low.” High and low need not be defined in terms of “one 
chance in twenty.” A different level of probability could be used, as long 
as it is used consistently. 

The returns on most securities are correlated. If the Standard & Poor 
Index rose substantially, we would expect United States Stee! (Common) 
to rise. If the Standard & Poor Index rose substantially, we would also 
expect Sweets Company of America (Common) to rise. For this reason 
itis more likely that United States Steel will do well when Sweets Company 
does well. If the returns on two securities are correlated only because 
they are correlated with things in general (and the Standard & Poor Index 
in particular), then we say that they are normally correlated. The returns 
on two securities may be more highly correlated than would be indicated 
by their relationships with the Standard & Poor Index. Ifa company in 
the automobile industry does better than one would expect from the rise in 
the Index, there is a greater chance that a company in the tire business will 
do better than one would expect from the rise in the Index, The returns 
from securities of such companies are referred to as highly or very highly 
correlated, depending on the degree of correlation. It is also possible for 
securities to be less correlated than would be indicated by their relation- 
ships with the Index. This would be the case for two firms in the same 
industry, if the total demand for the output of the industry were stable 
but sales of individual firms were subject to severe competition. The 
securities of such firms would be referred to as fess than normally or much 
less than normally correlated, depending on the degree to which one 
security would tend to move opposite to the other if the Index stayed 
constant. 

Space is provided on the form to indicate securities in the analysis with 
which the particular security is very highly correlated (HH), highly cor- 
related (H), less than normally correlated (L), or much less than normally 
correlated (LL). Normal correlations (IN) need not be listed. 
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Such forms, anda small amount of written material accompanying them, 
‘must be supplemented by verbal communication between portfolio analyst 
and security analyst. In particular, special effort must be made to ensure 
4 meeting of minds on notions such as “one chance in twenty” and “norm- 
ally correlated.”” 

In Figures 1a through li we presented information concerning past 
performances of individual securities. The portfolio analysis based on 
this information derived conclusions concerning past performances of 
portfolios. Many of these conclusions were not self-evident before the 
analysis, even though the past performances of individual securities were 
systematically arrayed for inspection. In a similar manner, a security 
analyst may have beliefs about individual securities without seeing their 
implications for portfolios as a whole. The purpose of the portfolio 
Analysis is to bring out these implications. 

Ry analogy, a poker player may believe that a deck of cards is well 
shuffled. Yet he may not know all the implications of this belief. He is 
Hot likely to know, offhand, the probability of beating 3 aces and 2 jacks; 
or of beating 4 cights anda king if deuces are wild. It is usually not polite 
OF convenient to employ a computing machine to calculate probabilities 
during the course of a poker game. In portfolio selection, however, the 
tlukes are higher and decisions should be made on the basis of thorough 
Analysis. 


A Vanity oF PorTFoLio ANALYSES 


The illustrative portfolio analyses discussed in previous sections are but 
{wo of a variety of portfolio analyses which could be performed. No 
Hingle type of analysis is right for all purposes. The choice of analysis 
lepends on the nature and goals of the investor. The present section 
iseusses some of the ways in which portfolio analyses can vary. 

Alternative Inputs. Methods have been suggested for predicting future 
‘eurity performances from past performances, volume of transactions, or 
oiler measurable phenomena. I shall not discuss the pros and cons of 
#iy such system, except to note that no method of prediction is infallible, 
With few exceptions, predictions are subject to some range of uncertainty. 

‘0 investor might feel that a particular method of prediction has merit. 
He might wish to select a portfolio which uses its predictions but, through 
foper diversification, reduces some of the uncertainty attached to predic- 
Wiens for individual securities. The predictions, and estimates of the 
{noertainty of predictions, can be used as inputs to a portfolio analysis. 
Th outputs of the analysis are, then, implications of the method of 
Wediction for better and worse portfolios. 
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Another approach is to consider such predictions, along with other data, 
to arrive at the probability beliefs discussed in the preceding section. 

Taxes. Some investors are not subject to Federal Income tax. For 
others this is a prime consideration in the choice of portfolio. A portfolio 
analysis can take into account 


(i) the differences in taxes on income and capital gains, 
(ii) the change in tax rate with change in income level, and 
(iii) the fact that capital gains taxes are paid only if the gains are 
“realized” (if the security is sold). 


Exogenous Assets. Many investors have sources of income other than 
their portfolio. For a private investor this might include a salary; for a 
university it would include tuition and endowments. These other sources 
of income are not irrelevant to the portfolio selection decision. It is 
desirable to choose a portfolio whose performance will not be too highly 
correlated with the investor's other income. It is undesirable to have a 
portfolio perform worst when it is needed most. Such considerations may 
be handled formally by including other sources of income as a fictitious 
security or “exogenous asset." The holding of this asset is not subject to 
choice in the analysis, but the correlation of its income with the yields of 
securities will influence the selection of portfolio. 

Cost of Living. Also of importance in many cases is the rise and fall in 
the costs of fulfilling the needs and obligations of the investor. To account 
for this the analysis may be performed in terms of “real return” rather than 
money returns. 

If the ten-security analyses discussed earlier had been based on real 
return rather than on money return, cash would have had a negative 
average return and a positive standard deviation. 


SUMMARY 


The examples of the present chapter have illustrated the inputs, the 
outputs, and the objectives of portfolio analyses. Parts II and III of this 
monograph discuss relationships and techniques by which information 
about securities are transformed into conclusions about portfolios. Part 
1V discusses topics related to the choice of criteria for a portfolio analysis. 


PART IL 


RELATIONSHIPS BETWEEN 
SECURITIES AND PORTFOLIOS 


Cuapter II 


AVERAGES AND EXPECTED VALUES 


MATHEMATICS AND You 


The relationships between securities and portfolios, to be discussed, are 
Mathematical in nature. They follow from definitions of terms and prop- 
erties of numbers, Like the theorems of geometry, they are subject to 
precise statement and rigorous deduction. 

Except for the appendices, this monograph was written to meet the 
feeds of the reader without mathematical training. The writer has 
alempted to illustrate concepts concretely, to avoid excessively terse 
proofs, to introduce essential mathematical -apparatus in easy stages. 
Successive chapters build on previously presented concepts, relationships, 
and apparatus, thus allowing the reader to raise his level of mathematical 
sophistication gradually. 

The non-mathematician cannot expect to skim this monograph as if it 
were a novel, or skip around in it as if it were a newspaper. The subject 
progresses step by step. The journey can be completed only if each step 
is taken in turn. Here are four rules which should aid the reader: 


|. Avoid “good” reading habits. Some modern reading methods en- 
“ourage the reader to grasp phrases in a glance, move steadily forward, 
jwever reread a passage, never mull over a detail. Although such practices 
May be excellent for quickly reading a novel, they are not suited to the 
fomprehension of unfamiliar mathematical material! Rapid reading 
becomes increasingly out of the question as we introduce more compact 
fotation. A few symbols can represent dozens of words of ordinary 
Fnglish. To attempt to swallow such a concentrated morsel in a single 
Wuip is bound to lead to intellectual indigestion. 

2. Pay particular attention to definitions. 11 is impossible for the reader 
1 understand the significance of a theorem or follow a proof if he does not 
How the exact meaning of terms. Terms of special importance are in 
iiulic type when first introduced. 


a7 
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3. Pay particular attention to theorems. A theorem is a compact, 
formal statement of an important relationship between concepts. Most of 
our discussions are directly or indirectly related to some theorem. They 
explain theorems, prove theorems, illustrate the importance of theorems. 
If the theorems are understood, their applications to problems of portfolio 
selection follow as corollaries. 

4. Take time to understand proofs. A proof shows that the relationships 
expressed in a theorem follow from definitions of terms and properties of 
numbers. A theorem learned by rote will soon loose meaning and slip 
from memory. Once the reasons for the validity of the theorem are seen, 
once its proof js understood, once the inevitability and logical necessity of 
the relationship are comprehended, the theorem becomes like an old 
friend not easily forgotten, quick to be recognized if met again. 


THREE TOPICS FOR THE PRICE OF ONE 
We shall conduct parallel discussions on three distinct topics: 


first, we shall consider relationships among past returns, particularly 
relationships between past returns on securities and those on portfolios; 

second, we shall consider relationships among probability beliefs, 
particularly the probability beliefs of securities and their implications for 
portfolios; and 

third, we shall consider relationships among random variables. 


The relevance of the first two topics was illustrated in the last chapter. 
The nature and significance of the third topic require a word of explanation. 

Arandom variable, as we shall use the term, is a number generated by 
achance device. Thus the number of dots appearing after a roll of dice, 
the number of aces drawn in a hand of cards, the number resulting from 
a spin of a roulette wheel are random variables. 

As noted in the last chapter and discussed in Chapter XII, the relation- 
ships which apply to the objective probabilities of random variables also 
apply to consistent probability beliefs. It will be convenient to illustrate 
definitions and relationships with specific random devices rather than 
immediately discussing them in terms of the more abstract notion of 
probability beliefs. Applications to random variables will frequently 
serve as concrete illustrations of relationships applicable to probability 
beliefs, and to past series as well. 

The three topics—relationships among past averages, relationships 
based on probability beliefs, and relationships among random variables— 
are different in substance but (in certain respects) are identical in form. 
A past frequency is not the same thing as a belief about the future. A 
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probability belief about the future is not necessarily a true, objective 
probability, Nevertheless, certain arithmetic relationships among past 
fiverages are exactly the same as corresponding relationships among 
fandom variables. The latter, in turn, are the same as relationships 
‘among probability beliefs. Thus a discussion of one of our topics can 
frequently serve as a discussion of all three. 


FREQUENCY DisTRIBUTIONS, PROBABILITY DISTRIBUTIONS 


A spin of the “wheel of fortune" in Figure 1 determines the next value 
of a random variable. The numbers on the perimeter of the wheel may 
he thought of as amounts won or lost if $1 is wagered. Thus, if the wheel 


Figure 1. Wheel of fortune. A spin of the wheel determines the next value of a 
fandom variable. .05 represents a 5% gain; —.1O-represents a 10% loss, 


Hops with the pointer at .10, the bettor wins 10 cents per dollar bet: he 
jus a 10% return on his money. If the wheel stops with the pointer at 
05, the bettor loses 5 cents per dollar bet: he incurs a 5% loss. 
The significant facts about the wheel can be summarized in tabular or 
faphical form. Table | shows. 


the outcomes which can result from a spin of the wheel, 
the number of times each possible outcome appears on the wheel, and 
the probability that the outcome will result from the spin of the wheel, 


We assume that each place on the wheel has the same chance of ‘appearing 
Wuler the pointer. Thus the outcome .05, which appears in 3 of the & 
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Figures 2 and 3 and Table 1 tell us how probability is distributed among 
Possible outcomes. Each presents the probability distribution of the 
fandom variable. 


places on the wheel, has 3 chances in 8, that is, a probability of 3/8, of 
appearing as the result of the spin of the wheel. 


Taste 1 H 
PROBABILITIES OF OUTCOMES h i 
7 Number of Se eee 
Possible se = 10 05 00 05 lo 
Outcomes | APpearance | Probability 
on Wheel Figure 3. Probabilities of outcomes. This is an alternative way of presenting the 
‘Outcomes on the wheel in Figure 1. (See also Figure 2.) 
—.10 1 1/8 
7 ; i For our purposes, a probability distribution contains all pertinent 
05 3 38 information concerning a random variable. The color and size of the 
10 1 1/8 Wheel are not relevant. Neither is the placement of outcomes around the 
f wheel, as long as the wheel is perfectly balanced and impeccably honest. 
Total 8 1,00 


Taste 2 
PROBABILITY BELIEFS 


This table presents the probability distribution 
of the random variable of the wheel in Figure 1. 


The information in Table 1 is presented graphically in Figures 2 and 3. Return Probability 

In both figures possible outcomes are represented by asterisks (*). The 

=. 1/30 

+ — “a | i 

-10 = 05 r 0 es 

Figure 2. Probabilities of outcomes, Each asterisk represents an outcome on the 10 8/30 

wheel in Figure 1. 20 5/30 

: 0 2/30 

figures differ in their treatment of outcomes appearing more than once on 40 1/30 

the wheel. Thus the outcome .05 is represented like this 50 1/30 


! 
ce 


05 


Tabular presentation of (hypo- 

thetical) probability beliefs, 

in Figure 2, and like this 
Several random devices can have the same probability distribution, 

‘Our discussions will be concerned with probability distributions rather 

‘han with physical mechanisms by which random variables are produced. 
Wheels such as that in Figure | will, nevertheless, be valuable in helping 

W concretely to visualize abstract relationships. In this and following 


in Figure 3. *)\wpters we shall see wheels with random variables, wheels with pairs of 
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random variables, wheels with several random variables, new wheels 
constructed in specific ways from old wheels, and others. Each will 
illustrate an abstract idea relevant to the analysis of portfolios. 


Probability beliefs about future, uncertain events, like probabilities of a 
random event, can be presented in a tabular or graphic form. An indi- 
vidual might feel that the return (including capital gains) on a security 
during the next year will be somewhere between —30% and +50%. The 
probability beliefs which he associates with various possible outcomes 
might be as represented in Table 2. Thus, according to the table, the 
individual feels that there are 4 chances in 30 that a loss of 10% will be 
incurred, whereas he feels there are 8 chances in 30 that a gain of 10% will 
occur. 


Probability 


sr ti 
00 102005. 
Return 


Figure 4. A distribution of probability beliefs. Graphical representation of the 
probability distribution in Table 2. 


The information in Table 2 is presented graphically in Figure 4, Figure 
4 is patterned after Figure 3. Table 2 could also be represented, perhaps 
not so conveniently, by a figure patterned after Figure 2. 


Frequency 


ie ie aT leew al 
-30 -20 -10 0 f.10 30 
Figure 5. Returns on American Tobacco Comm. Stock. Graphical representation 
of past retums in Table 3. 
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TABLE 3 


RELATIVE FREQUENCY OF THE RETURNS ON 
AMERICAN TOBACCO COMMON STOCK, 1937-54 


10) (2) Q) 
Relative 
ae) Frequenc) Frequency 
(by intervals) | ODO |e, 
—.35 to —.25 2 2/18 
= to —15 0 o/s 
=.15 to —.05 4 ais 
=05to 05 4 4/18 
Oto 1S 2 2/18 
Sto 25 2 2/18 
to 35 1 a/18 
to 45 2 2/8 
ASto 5S 1 Wg 
SUM = 18 SUM =1 


Vrequencies of past occurrence can also be summarized in a tabular or 
Waphical manner. Table 3 presents, for example, the following informa- 
Mn concerning American Tobacco Common Stock: 


¢voh annual return which occurred during an 18-year period, 
the frequency with which each return occurred, and 
the relative frequency of occurrence (= frequency divided by 18). 


‘The same information is presented graphically in Figure 5. 


Avihact AND Exrectep VaLue DEFINED 
Ih this section we shall define 


(1) the average value of a past event, 
(2) the expected value of a random variable, and 
()) the expected value of an uncertain event. 


We shall also show how and why a discussion of one of these concepts can, 
1) tain matters, serve as a discussion for the other two. 
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The average value of a series of numbers—for example, the average past 
return of a portfolio—was defined in Chapter I as the sum of the series 
divided by the number of items in the series. 

Average past return may also be defined in terms of relative frequency 
of occurrence, as is illustrated by Table 4. The first column of the table 


TABLE 4 


AVERAGE RETURN OF AMERICAN TOBACCO COMMON 
STOCK IN TERMS OF RELATIVE FREQUENCIES 


0) 
Ree branes ie , 
(midpoint) ‘fiom Table 3) (1) times (2) 
of Interval 
=.30 2/18 — 60/18 
-.20 of18 na 
=.10 4/18 — A018 
00 4/18 oe 
+.10 2/18 20/18 
+.20 218 40/18 
+.00 1/8 30/18 
+40 218 80/18 
+.50 1/18 50/18 
SUM = Average 
SUM =1 Past Return 
= 1.20/18 = .067 


lists returns which occurred during 18 years. The second column presents 
the relative frequencies of occurrence. The third column is 


(the value of the return) times (its relative frequency). 


That is, the third column is the product of the first and second columns, 
The sum of the entries in the third column is the average past return. 
The asterisks in Figure 5 indicate the past returns on American Tobacco 
Common. The arrow marks the average of these returns. Figures Sa 
through Sh show the past returns and their averages for eight other 
securities used in Chapter I. These illustrations should provide an 
intuitive feel for the relationships between typical series and their averages. 
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A spin of the wheel in Figure | determines the next value of a random 
Variable, The expecied value of this random variable is, by definition, the 
Average of the numbers on the wheel. The expected value of the random 
ible in Figure 1, then, equals 
05 + 00 + 05 — .05 + .05 + .10 + .00 — 10 
8 


= 0125. 


Hxpected value can also be calculated in terms of probabilities of outcomes. 
I Table 5, the first column lists possible outcomes; the second column 
presents probabilities of outcomes; the third column is 


(the value of the outcome) times (its probability). 


The sum of the entries in the third column is the expected value of the 
fandom variable, 


TaBLe 5 
DERIVATION OF EXPECTED VALUE FROM PROBABILITIES 


a Q) @) 
Outcome Probability (1+ Q) 
=.10 1/8 
—05 1/8 
00 28 
05 38 
10 1/8 
SUM 1,00 10/8 = .0125 


One must distinguish between an average outcome and an expected 
fleome. A spin of the wheel in Figure | will result in either a —.10, or 
#05, or a .00, or a .05, ora 10. Thus the average outcome of a spin 
{ihe result of the spin divided by 1) will be one of the five aforementioned 
fwmbers. The expected value of the spin, on the other hand, is the average 
‘of the numbers on the wheel. If we keep these distinctions in mind, we see 
that the following are meaningful statements: 


the random variable turned out to be greater than its expected value; 
the random variable turned out to be less than its expected value; 


ott 


Frequency 


Frequency 
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Figure Sa. American Telephone & Telegraph. 
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Figure Sb, United States Steel. 
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Figore Sc. General Motors. 
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Figure Sd. Atchison, Topeka & Sante Fe. 
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Figure Sh. Sharon Steel. 


48 PORTFOLIO SELECTION 


the average of the ten spins was very close to the expected value of the 
random variable 


The reader should keep in mind the exact sense in which we use the term 
“expected value.” Expected value is only imperfectly related to the verb 
“to expect.” The expected value of the random variable generated by the 
wheel in Figure | is .0125, Yetno one would expect this to be the outcome 
since .0125 does not appear on the wheel. Expected outcome, as we use 
the term, is no more or less than an average. Strictly speaking, it is, as 
illustrated in Table 5, the weighted average of possible outcomes with 
probabilities of occurrence used as weights. 

Our definition of expected outcome raises two questions: 


1, Why attach the term “expected outcome” to the concept “a weighted 
average of possible outcomes using probabilities as weights”? 
2. Why use this concept at all? 


The answer to the first question is “convenience and tradition.” We shall 
use the concept frequently; we need a label for it. Since the eighteenth 
century, statistical literature has associated this term with this concept. 

The second question, why use this concept at all, will be discussed in the 
next section, “Alternative Measures of Central Tendency.” 


The expected outcome of an uncertain event is defined exactly like the 
expected outcome of a random variable, except that probability beliefs are 
used instead of objective probabilities. If the second column of Table 4 
represented an individual's probability beliefs concerning possible returns 
on a security, then the sum of the entries in the third column is what we 
refer to as his expectation, or expected return, for the security. 

The first column in Table 6 lists possible outcomes. The second 
column contains a set of numbers which sum to one. The third column 
is the product of the first two. Its sum is written below the column. 

The table could represent the probability distribution of a random 
variable, the frequency distribution of a past event, or a set of probability 
beliefs. It is impossible to tell which. Depending on what kind of 
distribution is represented by the second column, the sum of the third 
column is the expected value of a random variable, the expected value of 
an uncertain event, or the average outcome of a past event. 

From an arithmetic point of view, past averages and the two types of 
expected values are identical. Each is defined, in exactly the same way, 
in terms of a set of numbers such as that in Figure 5. A discussion of the 
arithmetic relationships governing any one of the three can be casily 
translated into a discussion of either of the other wo. 
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TasLe 6 
WHAT KIND OF DISTRIBUTION IS THIS? 


Q) | Q) ()-Q) 
=.10 ds 01 
—.05 x —.01 
00 3 00 
+.05 2, +01 
+.10 ct +01 
+.15 al +015 
+.015 


Attimnative Measures OF CENTRAL TENDENCY 


“Averages and expected values measure central tendency: they tell us 
Where the center of a distribution is located. An average past return is a 
fwinber about which past returns fluctuated. An expected future return 
1} centrally located with respect to possible future outcomes. 

Average and expected value are not the only measures of central 
Wndency. We shall compare these measures with two others frequently 
Hind in statistics texts. 

Table 7 describes three measures of central tendency. For each it 
ie 


{4) its technical name, 

(b) an alternative name, appropriate when the measure is applied to past 
win, 

(0) an alternative name, appropriate when the measure is applied to 
fandom variables or probability beliefs, and 

{W) a brief description of the measure. 


Thus the mean is the technical name for an ordinary average. The 
{enn ofa past series is called its average. The mean value of a probability 
‘Weibution is called the expected value of the random variable or uncertain 
Pi) The mean, as we already know from our discussion of expected 
‘whet and averages, is a weighted average of possible outcomes with 
frobwbilities or frequencies used as weights. 


50 PORTFOLIO SELECTION 


TaBLe 7 
THREE MEASURES OF CENTRAL TENDENCY 


Alternative 


Alternative Name, 

Technical Name, random 

Name past series variables Description 

Mean Average Expected value A weighted average using 
frequencies or probabilities 
as weights 

Mode Most frequent Most likely That number (or those 
numbers) which appears 
more frequently, or has a 
higher probability than any 
other 


Median Midpoint Midpoint A number such that 50% of 
the observations (or 50% 
of the probability) are 
associated with numbers 
equal to or greater than this 
number, and 50% are 
associated with numbers 
equal to or less than this 
number 


The mode is an outcome which appears more frequently or has a higher 
probability than any other outcome, Thus the mode of the distribution 
in Figure 2 is 0S. The distribution in Table 8 has two modes, since .00 
and .05 have the same, highest, probability (or frequency). 


Tobie 8 


A DISTRIBUTION WITH TWO MODES AND 
MANY MEDIANS 


—.05 6 

.00 1/3 
+.05 3 
+.10 116 


The median is the midpoint of a distribution. At least one-half of the 
probability (or frequency) is associated with outcomes equal to or smaller 
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than the median; at least one-half is associated with outcomes equal to or 
larger than the median. Thus the distribution in Table 2 has a median of 
10, since the sum of the probabilities associated with outcomes equal to 
oF less than .OL is 21/30, while that associated with outcomes equal to or 
freater than .10 is 17/30. In the distribution in Table 8, any number 
hetween .00 and .05 (inclusive) is a median. For example, .025 is a 
median, since the sum of the probabilities associated with outcomes less 
than .025 equals 1/2, as does the sum of the probabilities associated with 
uleomes greater than .025. 

Hefore discussing the relative merits, for portfolio analysis, of aiernative 
Measures of central tendency, let us consider the use made of such measures. 
In Chapter II, you recall, average past return was used as one of two criteria 
for evaluating portfolios. The other criterion was standard deviation, a 
Measure of variability. The average returns and standard deviations of 
Portfolios were plotted in Figure 4 of Chapter II. If a portfolio had an 
fvernge return and standard deviation represented by a point on the curve, 
i! was referred to as efficient. If a portfolio was not efficient, it was 
Pomible to find an efficient portfolio with either 


(4) greater average return but no greater standard deviation, or 
(b) less standard deviation but no less average return. 


Hather than average return we could have used mode or median as a 
Measure of central tendency. Rather than standard deviation we could 
Nave used, for example, “maximum loss” or “frequency of loss” as the 
fasure of instability. We could have constructed a figure with mode on 
the Horizontal axis and maximum loss on the vertical axis. Wecould have 
Hawn a curve showing the smallest maximum loss obtainable among 
forifolios with a given level of mode, We could have defined a portfolio 
W be efficient if its mode and maximum loss were represented by a point 
WY this new curve. 

We must distinguish between 


18) & portfolio deemed efficient in an analysis based on mean and 
Weidard deviation and 
{h) 8 portfolio deemed efficient in an analysis based on mode and 
simum loss. 


Tie first we will refer to by the clumsy but mnemonic phrase: a mean— 
Word deviation—efficient portfolio. The second we will refer to as a 
Wl-maximum loss—efficient portfolio. We can similarly define a 
“Wollin—standard deviation—efficient portfolio,” a “‘mean—probability 
© loyefficient portfolio,” or a portfolio indicated as efficient in an 
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analysis based on any measure of central tendency and any measure of risk, 
variability, or uncertainty. 

One measure of central tendency is better than another if it generates 
better efficient portfolios. If the best “‘mode—standard deviation— 
efficient portfolio" is generally better than: the best “mean—probability 
of loss—efficient portfolio,” the best “median—standard deviation— 
efficient portfolio,” etc., then mode is the best measure of central tendency 
and standard deviation is the best measure of instability. 

In some cases several different combinations of measures lead to the same 
set of efficient portfolios. This is true, in particular, when probability 
distributions of returns can be described by one of several standard 
patterns. If, for example, probability distributions are assumed to 
follow the (bell-shaped) normal distribution, illustrated in Figure 6a, then 
a portfolio is mean—standard deviation—efficient if and only if it is 


mode—probability of loss efficient, 
mode—standard deviation efficient, 
mean—probability of loss efficient, 
median—standard deviation efficient, and 
median—probability of loss efficient. 


The above equivalence between efficient sets also holds if probability 
distributions follow the “log-normal”’ distribution illustrated in Figure 6b. 

In cases such as these the choice of measures is primarily a matter of 
convenience. For example, in the analysis based on probability beliefs, 
discussed in Chapter IJ, it was assumed that probability beliefs approxi- 
mately followed the log-normal pattern. The questionnaire was phrased 
in terms of mode, that is, “‘most likely return.” For convenience of 
computation, mode was translated into mean. For ease of interpretation, 
mean was translated back into mode. 

It may happen, however, that different combinations of measures yield 
different efficient portfolios. For such cases we must seek a combination 
of measures, a measure of central tendency and a measure of instability, 
which generally produces better efficient portfolios than any other 
combination of measures. 

‘A comparison of measures can be based either on specific instances of 
on general principles. We could show cases in which one measure clearly 
did better than another in describing the relative worth of two portfolios. 
We could show clearly undesirable properties possessed by one measure 
and not by another. From such particulars we might gain a feeling about 
at least some of the strong and weak points of different measures. Or, 
alternatively, we could deduce the relative merits of different measures 
from fundamental principles of choice in the face of risk and uncertainty, 
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‘The two approaches, one based on specific instances, the other based on 
feneral principles, are supplementary. Instances give concreteness to 
Principles; principles provide generality to instances. Eventually both 
‘approaches will be taken in this book. 
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Figure 6. (a) The “normal” distribution; (b) the “log-normal” distribution. 


‘We are not ready to explore principles of behavior under uncertainty. 
‘NWF present discussion will be confined to particulars rather than uni- 
Wreely, Our final discussions of behavior under uncertainty will be 
Joulponed to Chapters X through XIII. 


Figure 7 presents two distributions, (a) and (b). Distribution (b) may 
1 thought of as a slight modification of distribution (a): (b) results if the 
~ W relurn in (a) is increased to a —.15 return and one of the .20 returns 
1 (8) is increased to a .25 return. Distribution (b) is clearly better than 
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distribution (a), since it can be obtained by increasing some returns while 
decreasing none. 
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Figure 7, Two (hypothetical) distributions of returns. 


Yet distribution (2) has a mode of +.20, while distribution (b) has a 
mode of —.15. Thus we see that a small difference between distributions 
can make a large difference in their modes. More important, this differ- 
ence in mode can be completely misleading. 

A third distribution of returns is presented in Figure 8. This third 
distribution may be thought of as a modification of distribution (b) in 
Figure 7. The distribution in Figure 8 results if the .10 return in (b) is 
increased to a .20 return, the .20 return in (b) increased to a .40 return, and 
the .25 return increased to a.45 return. Clearly the distribution in Figure 
8 is superior to distribution (b). 
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Figure 8, A third distribution of returns. 


Yet the median of the distribution in Figure 8 is the same as the median 
of distribution (b). In fact, both of these are equal to the median of 
distribution (a). Thus we see that the median is insensitive to changes, 
even to unambiguously good or bad changes, as long as they do not affect 
the midpoint of the distribution. 
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An increase in returns, as between distribution (a) and distribution (b) 
in Figure 7, is always associated with an increase in mean. If a single 
‘silerisk is shifted to the right, the increase in mean equals 

the amount by which the asterisk is shifted 
the number of asterisks in the distribution’ 


Hence mean cannot move in the wrong direction, as the mode could; nor 
‘an the mean be insensitive to an increase not offset by any decrease, as 
ould the median and the mode. 

It would seem, then, that the mean is a more appropriate measure of 
sentral tendency than is either mode or median. Further substantiation 
of the appropriateness of the mean is presented in Chapter XIII. 

Hesides being a more desirable measure of central tendency, the mean is 
# More convenient measure in at least two respects. First, a distribution 
fii Never have more than one mean. It can, however, have many modes 
nd medians (see Table 8). Second, the arithmetic relationships between 
Means of securities and means of portfolios are much simpler than the 
‘orresponding relationships for modes and medians, 

‘The rest of this chapter will discuss these relationships between means of 
Weurities and means of portfolios. 


A ConsTANT TIMES A RANDOM VARIABLE 


The wheel in Figure 9 generates a pair of random variables. The 
‘ihbers on the outer ring of the wheel are exactly the same as those on the 
wheel in Figure 1. The numbers on the inner ring are, in every case, 
thee times as large as the numbers on the outer ring. The random number 
sherated by a spin of the wheel and a reading of the inner ring is inevitably 
ther times as large as the random number generated by the same spin and 
# feiding of the outer ring. 

For ease of reference we shall name our two random variables. That on 
the outer ring we shall refer to as the random variable r. That on the 
Wer ring we shall refer to as the random variable 3r. Their names 
find us of the relationship between them. 

The expected value of the random variable 3r can be computed in the 
Wal way; 


15 + 00 + 1S — 15 + .15 + .30 + 00 — .30 
g 
Ha when we know the expected value of a random variable such as r, 


{Wie is an easier way to obtain the expected value of a random variable 
fw ws Sr, The expected value of the random variable 3r is three times 


= .0375. 
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the expected value of the random variable r (.0375 = 3 times .0125). 
More generally, for any random variable on any ring of any wheel: 


a random variable which is always three times as large has an expected 
value three times as large; 

a random variable which is always 1000 times as large has an expected 
value 1000 times as large; 

a random variable which is always one-half as large has an expected 
value one-half as large; 

a random variable which always equals (—2) times the original random 
variable has an expected value twice the magnitude of, but opposite in 
sign to, the original random variable. 


In practice, r might be the return on a $1 bet; then 3r would be the 
return ona $3bet. Orr might be the return per share of a security; then 


Figure 9. Random variables: r and 3r, The outer ring determines a random vari- 
able r. The inner ring determines a random variable 3r. The expected value of 3r 
is three times the expected value of r. 


1000r would be the return on 1000 shares. Orr might be the return per 
dollar of portfolio if the entire portfolio were invested in security A; then 
one-half r would be the return from security A, per dollar invested in the 
portfolio, if one-half the value of the portfolio were invested in security A. 
Or r might be the amount which the roulette croupier gains from, or loses 
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10, the player if $1 is bet on “red”; then —2r is the amount which the 
player wins from or loses to the croupier on a $2 bet on “r 
Ifa random variable r had an expected value of .01, then 


the expected value of 3r would be .03, 

the expected value of 1000r would be 10.00, 
the expected value of .5r would be .005, 

the expected value of ~2r would be ~.02. 


‘The gencral relationship is stated in the following theorem. 


‘Thwonem: Suppose r is any random variable generated by any wheel; 
suppose 4 is any number; 
Suppose Ar is the random variable which is always A times as large as r; then 
the expected value of Ar is A times as great as the expected value of r. 
The proof of this theorem involves only definitions of terms and well-known, 
felationships of arithmetic: Suppose that r is on the outer band and Ar is 
4 the inner band of some wheel. By definition, the expected value of r 
equals 
the sum of the numbers on the outer band 
the number of places on the wheel 


Similarly, the expected value of Ar is, by definition, 


the sum of the numbers on the inner band 
the number of places on the wheel 


fyery number on the inner ring is A times as great as the corresponding, 
fwinber on the outer ring; hence, by a basic property of numbers, the sum 
Of the numbers on the inner ring is A times the sum of the numbers on the 
Wler ring. That is, 


the sum of the inner ring = A times (the sum of the outer ring). 


Dividing both sides of this equation by the same number, we obtain the 
few equality: 
the sum of the inner ring _ 


the number of places anes 
on the wheel 


the sum of the outer ring 


the number of places 
on the wheel 


Fipally, substituting definitions, we find that 
jhe expected value of Ar = A times (the expected value of r). Q.E.D. 


The theorem stated and proved for the random variables of our wheels 
11 he restated to apply to past series and uncertain events. Specifically: 
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(1) if past series 1 is always A times as large as past series 2, then the 
average value of past series 2 is A times as great as the average value of 
past series 1; and, 

(2) if it is believed that future outcome | must inevitably be A times as 
great as future outcome 2, then the expected value of future outcome 2 
must be A times as great as the expected value of future outcome 1. 


Because of the formal identity among the subjects discussed, no special 
proof is needed for the last two assertions. 
RANDoM Pairs 


A spin of the wheel in Figure 10 generates a random pair of variables. 
For example, if the wheel stops as in the figure, the result of the spin is 
the pair (.05, 10). The random variable on the outer ring will be referred 


Figure 10, Random pairs of variables. 


toasg. That on the inner ring will be referred to as the random variable r. 
Thus a spin of the wheel determines a value of g and a value of r: it 
determines a pair (9, r). 

The probabilities associated with different pairs can be presented 
graphically. Each pair of numbers (,r) on the wheel in Figure 10 is 
represented by an asterisk in Figure 11. The horizontal axis gives the 
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4 Value of the pair; the vertical axis gives the r value of the pair. Pairs 
{y,r) which appear more than once are represented by two or more 
Adjacent asterisks. 


10 . 
05 . 
i ‘ * aly 1 
=10 =05 05 109 
: . = 05 
. =.10 


Figure 11, Possible pairs of outcomes. 


The asterisks in Figure 12 represent the pairs of numbers on the wheel 
Ww Figure 9. Since each number on the inner ring is exactly three times 
that on the outer ring, the asterisks in Figure 12 lie in a straight line. Ina 


r= = 15 =.10 =.05 nr 


Figure 12, Possible pairs of outcomes. 
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The probabilities of various possible pairs of outcomes in this case are 
fepresented by asterisks in Figure 14. Random variables such as q and r 
of Figure 14 are said to be independent. Knowledge of the ¢ outcome 
fives no hint as to the outcome of r. 


case such as this we say that the two random variables are perfectly 
correlated 

In Figure 1! the g values and r values show some tendency to be high 
and low together. This tendency is not perfect, however—the asterisks do. 
not lie ina straight line. We therefore say that the variables are correlated 
but not perfectly correlated, 


Figure 14, Possible pairs of outcomes. 


‘The information in Figures 10 through {4 can also be presented in a 
{able. Table 9 illustrates one method of presentation. Theg and r values 


TaBLe 9 


PROBABILITIES OF PAIRS— 
A TABULAR PRESENTATION OF FIGURE 11 


q r Probability 
Wheel 2 Wheel 3 
=.10 =.10 wiz 
O) ~.10 ~.05 irs 
: , a =.05 05 2 
Figure 13. Two ways of generating (g, r) with the same probabilities. re rs a2 
00 00 wiz 
Different chance mechanisms can produce random pairs with the same 05 00 212 
probabilities. The probability of obtaining a pair (g, r) is the same, for .05 410 wiz 
example, whether we 10 0s yi2 
10 410 Wi2 


(a) spin wheel | in Figure 13 and let be the number on the outer wheel, 
and r be the number on the inner wheel; or 

(© spin wheel 2 in Figure 13 and let its outcome be g; then spin wheel 3 
in the same figure and let its outcome be r. 


‘of each pair are listed in the first two columns; the probability of the pair 
i piven in the third column. Table 10 illustrates an alternative method of 


2 


presentation. Values ofg are listed along the side of the table; values ofr 
are listed across the top of the table. The probabilities associated with 
cach pair are listed within the table. Thus the probability of the pair with 
ag of .05 and an r of .10 is presented in the cell whose row is marked .05 
and whose column is marked .10. The method of presentation of Table 
10 is generally more compact and gives a better visual impression of the 
probability distribution than does the method of presentation used in 


Table 9. 


TABULAR PRESENTATIONS OF FIGURES 11, 12, AND 14 
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PROBABILITIES OF PAIRS— 
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Taste 10 (Continued) 
© 


ty 

00 wiz wiz 1/12, 
05 2/12 a2 212 
10 1/2 V2 Vi2 


(a) 
r ‘Th ExpecreD VALUE OF THE SUM OF Two RANDOM VARIABLES 
4 The wheel in Figure 15 illustrates the notion of a random variable which 
=10 —05 0 05 10 it the sum of two other random variables. The first and second rings of 
—.10 12 Wi 
-.05 Wi2 1t2 
00 12 
0s 42 ald 
10 Wiz 412 
(b) 
3r 
r 
-.30 -15 00 | lS 30 
=.10 1/8 
=.05 18 
oo 14 ap Figure 15. The sum of two random variables. 


1/8 
B ie wheel in Figure 15 are the same as the two rings on the wheel in 


Pipure 10. The numbers on the third ring in Figure 15 are, in every case, 
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the sum of the numbers on the first and second rings. Thus the random 
number generated by a spin of the wheel and a reading of the third ring is 
inevitably the sum of the numbers generated by the readings of the first 
and second rings. 

We shall continue to refer to the random variables generated by the first 
and second rings as q and r, respectively. We shall refer to the random 
variable generated by the third ring either as s orasr+q. s=q+r 
expresses the fact that s andg + rare different names for the same entity. 

In practice, q might be the return if $1 were bet on ring 1; r might be the 
return if SI were bet on ring 2. Then r + q would be the total return if 
$1 each were bet on both rings. 

The expected value of the random variable q is 


05 + .05 ~ 05+ .05+ .10+.10 +.05— .05—.10 —.10 + .05 + .00 EN AS 
12 12 


The expected value of the random variable r is 


10 + .00 ~ 05 + 104 .10 + .05 + 00+ 00 — 05 —.10+.10+.00 _ .25, 
12 12 


The expected value of the random variable s is 


AS + .05 — 10+ .15+ 20+ .15+.05—.05 = .15—.20+.15-+.00 _ 40 
12 12 


Thus 
Is | 25 _ 40 
iy 12 


Hence, in this case at least, the expected value of r + q equals the expecte 
value of r plus the expected value of g. This relationship is true for am 
sum of two random variables on any wheel, as is expressed by the followin, 


THEOREM: Suppose that g and r are a pair of random variables generated 
any wheel. Suppose s is the random variable which always equals the sum 
q andr, Then the expected value of s equals the expected value of ¢ plus t! 
expected value of r. 


Note that this theorem is asserted for any pair of random variables 
whether they are correlated or uncorrelated. 

Proor: We shall suppose that we have any two random variables, g and 
on the first and second rings of a wheel, We shall suppose also that the thi 
ing has the sum, s, of the first two rings. a 

Let us first show that the sum of all entries on the third ring equals (the sul 
of the entries on the first ring) plus (the sum of the entries on the second rin, 
If we let a, represent the first number on the first ring, a, the second number 
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the first ring, a, the third number on the first ring, a, the fourth number on the 
fat ring, and so on, until a, the last number on the first ring—then the sum of 
ihe numbers on the first ring is a, + a. +a; +a,+°**+ aq, If we let 
Hpresent the first number on the second ring, bp the second number on the 
soond ring, bs the third number on the second ring, b, the fourth number on 
ihe second ring, and so on, until ,, the last number on the second ring—then 
the sum of the numbers on the second ring is by + by + by + by + ** + By. 
‘The first number on the third ring is, by definition, a, + 6,; the second number 
oh the third ring is a, + b,; the third number on the third ring is oy + by; the 
fourth number on the third ring is a, + b,; and so on, until the last number on 
th third ring, a, + 6,. The sum of the numbers on the third ring is 
(ay + by) + (ag + by) + (ay + by) + (ay + bg) + °° + (aq + ba), 

which in turn equals 

May + a + ay + agt' + +44) + (6, + bp + by + byt: + > +5,), 


Thos 
(the sum of the numbers on the third ring) equals 
(the sum of the numbers on the first ring) plus 
(the sum of the numbers on the second ring). 
Dividing both sides of the above equality by the number of places on the wheel, 
ee find that 
the sum of the numbers on the third ring 
the number of places on the wheel 
the sum of the numbers on the first ring 
the number of places on the wheel 
the sum of the numbers on the second ring 
the number of places on the wheel 


equals 


Hs itis, in other terms, says that, 


the expected value of r + g equals 
the expected value of r plus 
the expected value of g. QED. 


Th Wxpeeteo Vatut oF « Weicitep Sum of Two RANDOM VARIABLES 


The wheel in Figure 16 illustrates the notion of a weighted sum of two 
‘ewlom variables. The first two rings generate random variables which 
‘Pe shall refer to asg and r, respectively. The third ring generates a random 
‘einble equal to 3g; the fourth generates a random variable equal to 2r. 
The Qumbers on the fifth ring are the sum of the numbers on the third and 
Jouih sings: Thus the fifth ring generates the random variable w = 

4% Ifqis the return from SI bet on ring 1 and ris the return from 

fel on ring 2, then wis the total return from $3 bet on ring 1 and $2 bet 
wing 2. 
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If the reader cares to compute the expected values of g, r, and w, he will 
find that the expected value of w: equals three times the expected value of 


Figure 16. Weighted sum of two random variables. Ring $ = w = 3y + 2r. 


q plus twice the expected value of r. This relationship between the ex} 
tations of two random variables and the expectation of their weighted su 
holds generally. 


THEOREM: Suppose that q and r are any pair of random variables generat 
by any wheel; suppose that 4 and B are any numbers; suppose, finally, that 
is the random variable which is always equal to 


Aq + Br. 
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Then the expected value of w equals A times the expected value of g plus B times 
ihe expected value of r, 

Proor: This theorem follows easily from two results previously established, 
‘The result concerning the ordinary sum of a pair of random variables tells us 
hat the expected value of Aq + Br equals the expected value of Ag plus the 
#apected value of Br. The result concerning a constant times a random variable 
“Nt {hat the expected value of 4q equals A times the expected value of g, 
anil the expected value of Br equals B times the expected value r. Combining 
these (wo results, we derive the theorem of the present section: The expected 
ph of Aq + Br equals A times the expected value of g plus b times the expected 
value of r. 


The theorem of the present section also applies, with slight rephrasing, 
1 past series and probability beliefs. For example, if security | has had 
4) average return of 12% and security II has had an average return of 8%, 
then the average return per dollar invested ina portfolio consisting of 1/4 
Weurity Land 3/4 security 1] was 


(1/4)(12) + (3/4)(8) = 9%. 


Tit WriGhteD AVERAGE OF ANY NUMBER OF RANDOM VARIABLES 


Mevious sections presented results applicable 10 one or two random 
‘eliubles. ‘The present section presents # more general result applicable 
1 aly number of random variables, 

It is necessary to modify our notation somewhat before discussing 
‘hilrarily large numbers of random variables. In discussing small groups 
W/ then we can refer to each by name: Joe, Ben, Charlie. In discussing 
WUE" groups we frequently use a different method of reference. We speak 
the first man, the second man, the first two hundred men, Thus far it has 
He" Convenient to give random variables names such as q andr. Now it 
Hill be convenient to refer to random variables as r,, ry, ry etc. 1, (read 
/Yub-| or simply 7-1) is, literally, the first random variable; rg is the second 
fetilom variable; 44, is the 147th random variable, If N represents the 
{tal number of random variables, then ry is the last random variable. 

nilarly, rather than speaking of numbers A and B, we shall speak of a 
Wl Humber 4,, a second number 4,, a 147th number Ayy>, and a last 
Huber Ay, 

The wheel in Figure 17 illustrates the notion of a weighted sum of several 
‘yelom variables. The random variables on the first four rings will be 
Wleited to as ry, rr and ry respectively. The random variable on the 
HH Hingis always 


(2 times r,) plus (3 times r,) plus (r,) plus (two times r,). 
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The above relationship is a special case of the following: 


‘Twwonem: Suppose that r,, rs, 73° * “Fy, is any set of N random variables 
ponerated by some wheel, Suppose that A,, 4,, Ay," * “Ay is any set of N 
fwmbers. Suppose that w is the random variable which is inevitably equal to 
Ait Ava + Aygy++ ++ Ayry. Then the expected value of w is 


(A; times the expected value of r,) plus 
(Ay times the expected value of r,) plus 
(Ay times the expected value of r,) plus 
(and so on, until) 

(Ay times the expected value of ry). 


If all the A’s equal 1 in the above theorem, we have the case of an 
Wdinary sum. Hence, if we prove the above theorem—that the expected 
*hlue of the weighted sum is the weighted sum of the expected values—we 
Nave also proved that the expected value of an ordinary sumis the ordinary 
§m of the expected values. For example, the expected value of r, + r + 
#) Muals the expected value of r, plus the expected value of r, plus the 
#apected value of ry. 

le have already proved special cases of the general theorem. When 
N= | the general theorem says that the expected value of Ayr, equals A, 
limes the expected value of r,. This case was shown in the section on 
"A Constant Times a Random Variable.” When N = 2 the theorem says 
tht the expected value of A,r, + Agr, equals A, times the expected value 
ofr, plus 4p times the expected value of r». This case was shown in the 
‘Wellon on “The Weighted Sum of Two Random Variables.” The fact 
‘Wl we have already established these special cases will be useful in the 
lool of the general theorem. 

The type of proof used for N = 2 could also be used for N = 3 or 4 or 
Ji 6. It would be clumsy, however, for N = 48 or 134. Furthermore, 
‘1 Would not meet our need: to prove the theorem forall values of N. For 
Wis We must adopt a new method of proof. 

The type of proof we shall employ is known as “mathematical induc- 
Wh" We start by noting that the theorem is true when N= 1. We 
‘*PP! prove that, ifthe theorem is true when M variables are involved, then 
1) alyo true when Mf + 1 variables are involved, Hence, the fact that it 
1) Wve for V = 1 implies that it is true for N = 2; the fact that it is true 
ty) N = 2 implies that it is true for N =3; and so on. Thus we have 
‘Won that the theorem is true for any N once we have shown that 


Figure 17, Weighted sum of several random variables. Ring $=w = 2r, +1 
tn tie 


The random variable generated by the inner ring will be referred to as tl 
weighted sum w. 


w= ln ty try t re 
The expected value of w in the present example is 
(twice the expected value of r,) plus 

(three times the expected value of 72) plus {4) the theorem is true for N= 1, and 

(the expected value of r3) plus {h) ifthe theorem is true for N = M then it is true for V = M +1. 


Uice the erece ea yaluelOnra), + have already shown that (a) is so. We must now prove (b). 
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Suppose, then, that the theorem is true for a weighted sum of M 
variables: 
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Avy + Ae t 00 + Ante 
Is it true for the weighted sum of M + 1 variables 
we Ah + Aglp $00 + Aut + Arata? 
A weighted sum of random variables is itself a random variable. 
Specifically, 
Ay + Anta to + Aut 
may be thought of as a single random variable. We may therefore think 
of w as the sum of two random variables, namely 
(Ayr t+ Aare $0 + Auta) + Anata 
The previously established result for the weighted sum of two random 
variables tells us that 
the expected value of »: equals 
the expected value of (Ayr) + Aare + 
Angas times the expected value of ry41- 
But we started by assuming that the expected value of 
Ay + Aare to + Aut 


equals A, times the expected value of r, plus A, times the expected value 
of r;, and so on. Hence the expected value of Ayr, ++ °° + Aagaifana 
equals 


+ Ayr) plus 


A, times the expected value of r; plus 
A, times the expected value of r, plus 


Ay times the expected value of ryy plus 
Ayyas times the expected value of ry44- 


QE, 


Tre Main Resucts 


The theorem of the last section is the main result of the present chap' 
as far as random variables are concerned. It says that the expected valu 
of a weighted sum is the weighted sum of the expected values. This 
true whether the random variables are correlated or uncorrelated. It 
true for any number of random variables, no matter how they move up 
down together. 
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‘The theorem of the last section can be rephrased to apply to past series 
OF to uncertain future events: 


1, Ifa series (S) is the weighted sum of N other series, then the average 
Of series Sis the same weighted sum of the averages of the N series. 

2, If an individual believes that the value (V) of some future event must 
inevitably equal a particular weighted sum of the values of N other future 
fronts, then, if his probability beliefs are internally consistent, the expected 


Value of V must be the weighted average of the expected values of the 
Nevents. 


‘The proof given for the theorem of the last section also applies for the 
‘hove relationships. These two relationships are our main results con- 
‘fPiHing expectations of uncertain future events and averages of past series. 


EXERCISES 


| Calculate the expected value of a toss of an ordinary six-sided die, 


+ Use the result of Exercise 1, plus the theorem on the sum of two random 
Wlables, to calculate the expected vaiue of a toss of a pair of dice. 


} During the years 1934-1954 the following securities had the following 
‘Peerage returns: 


security average return 
(1) American Tobacco, Common 6.6% 

(2) American Tel. and Tel,, Common 6 

(3) United States Steel, Common 14.62 


What were the averages of the returns on the following portfolios? 


Portfolio 1 
security % in portfolio 
1 33.3% 

2 33.3% 

3 333% 
Portfolio II 
security % in portfolio 
1 25% 
2 23% 
3 50% 
ANSWERS 


DH 2: 7; 


: 9.1; 3b: 10.5, 
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feond column. In the first year, for example, the squared deviation from 
the average was .09 times 09 = .0081. The average of the entries in the 
third column, .0034, is referred to as the “variance” of return. 

The variance of return on a past series, then, is the average of the 
Wlared deviations from the average. 

‘The variance of a past series can also be computed from a table of 
Wlative frequencies, Column 1 of Table 2 lists returns which occurred to 
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DERIVATION OF VARIANCE FROM RELATIVE FREQUENC:ES 
_—— 


The definition of the variance of a series is illustrated by Table 1. The Frequencies | 4 Deviations 
: ? ; verage Squares 
first column of the table lists returns on a hypothetical security during 5 
years, The average of these returns is.01. The second column indicates, 

= 0S 2s —.06 0036, 00144 

Tae I 0) 1/5 -0 0001 00002 

0S us 04 0016 00032 

RETURNS AND THEIR VARIANCE 
10 Ws 09 0081 00162 
ee — 
eviations 
Squared JM 1 
Returns from Deviation’ Mu 00340 
Average _ 
“W) hiypothetical security; column 2 shows the relative frequency of each 
10 0081 ‘Whim The average of the series, 01, can be computed from columns 1 
— 0S a 0036 4, as explained in the last chapter. Column 3 lists the difference 
0 fat 0001 eren the return and the average return; column 4 lists the squares of 
05 ‘ 0016 JH Hilries in column 3; column § presents the product of the entries in 
= 25 = D038 wins 2 and 4. The sum of column 5 is the average of the squared 
- jalions from the average. The sum of column §, in other words, is the 
Sum_.05 0 0170 Wlaioe of the series, 
Th slandard deviation is the square root of the variance. In the 
yeverace a Oe peo ee! ‘+eniple under discussion the standard deviation equals V-0034 = .058. 

eRe iy The ‘elationship between returns and their standard deviation is 


Wiaied in Figure 1. The asterisks in Figure 1 indicate annual returns, 
1917 through 1954, on Atchison, Topeka, and Santa Fe Common 
4} (See Chapter II, page 10.) The arrow marked A indicates 
‘Srtfage return. The arrows marked A + 1 St. Dv., A + 2 St. Dv., 
~ |S Dy., and A — 2St, Dv. indicate returns which equal, respectively, 
‘#yetage plus one standard deviation, the average plus two standard 


Standard deviation = V,0034 = .058. 


for each year, the difference between that year's return and the averaj 

return. Thus, in the first year, return was .09 greater than the five-y« 

average. The third column presents the squares of the numbers in 
n 


4 PORTFOLIO SELECTION STANDARD DEVIATIONS AND VARIANCES 15 


deviations, the average minus one standard deviation, and the average fe quite close to two standard deviations from the average. In other 
minus two standard deviations. For example, the arrow marked A + I fhe the relationship is quite rough. With security 8, for example, the 
St. Dv. is at the return 198 4 .357 = .555. The arrow marked A — | lowest return is only about 1.5 standard deviations from the average; with 
St. Dy. is at the return 198 — 357 = —.159. ‘ourity 7 the lowest return is 2.7 standard deviations from the average. 
The standard deviation is a measure of variability which takes more into 

A=2St Dv A-1St. Dv ms A41 St Dv A+25St Dv. ‘#eount than just one or two extreme returns, Figure 2 presents returns 


1 (wo hypothetical securities which have the same average and the same 

| | | | | Highest and lowest returns as the security in Figure 1. They do not, how- 
#00, have the same standard deviation of return. The returns in Figure 2b 

“0 -O-N-D- wh wm (iO Hf much more unstable than those in Figure 1; in Figure 2b returns are 


Figure 1. Returns on A. T. & Sfe Common (1937-54), A = average: St. Dv. = 
standard deviation; * = annual return. 


In Figure | the lowest return is approximately two standard deviations 
below the average: the asterisk farthest to the left is roughly in the 
neighborhood of the arrow labeled A — 2 St. Dv. The highest return, ee ay 
on the other hand, is approximately two standard deviations above the We 0-20-10 0 
average: the asterisk farthest to the right is in the neighborhood of the 
arrow labeled A + 2St. Dv, This relationship between the average, the 
standard deviation, the highest return, and the lowest return of a series is Figure 2. Returns on two hypothetical securities. 


not unusual, although it is far from exact. Table 3 compares A — 2 St. 


Dy. and A + 2 St. Dv. with the lowest and highest returns of the nine Wore {requently near their high or low extreme. The standard deviation 
securities used in Chapter I. In some cases the highest and lowest returns ‘W felurn for the security in 2b is substantially higher than that in Figure 1. 
Heiee A + 2 St. Dv. is substantially higher than the highest return. The 
Taste 3 ‘elurns in Figure 2a, on the other hand, are more stable and the standard 
COMPARISON BETWEEN AVERAGE RETURN, STANDARD Wriition is lower than that in Figure 1. Hence, for the distribution in 
DEVIATION, AND HIGHEST AND LOWEST RETURNS. Rips da, A + 2 St. Dv. is substantially less than the highest return. 
¢ see that the statement “the highest and lowest returns in a series are, 
, 2 Je often, approximately two standard deviations removed from the 
aaah vies | oe Aes | ep sa| ely po a ithe sais areran extremely rough rule of thumb. It does, 
Wowever, provide a start toward an intuitive feeling for the notion of a 
1 ier al 28 : ‘Wanard deviation. 
2 962 121 04 “ 
3 146 292 7390 ‘The variance and standard deviation of a random variable are defined 
4 173 309 791 3 ‘Huslogously to those of a past series, as may be illustrated in terms of the 
5 198 357 912 4 ‘eilom variable on the outer ring of the wheel in Figure 3. The random 
s 055) 203 40) ‘eriable on the second ring equals the random variable on the first ring 
i ZET n168, es inuis the expected value of the variable on the first ring. The random 
: Ora acs os ‘ ‘Wilable on the third ring is the square of the random variable on the 


Wend ting, The expected value of the third ring is the variance of the 
Hilom variable on the first ring. 
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The variance of a random variable, then, is the expected value of the 
squared deviation from the expected value. 

The variance of a random variable can be computed from a table of 
probabilities. The entries in the second column of Table 2 can be inter- 
preted as the probabilities associated with the random variable on the 
outer ring of the wheel in Figure 3. The variance of the random variable 
is computed from these probabilities exactly as the variance of return of 
the past series was computed from its table of relative frequencies. 


Figure 3, The variance of a random variable. The outer ring generates the random 
variable r. The second ring generates the random variable r, equal to r minus the: 


expected value of r. The third ring generates (7’)8. The expected value of (r’)*is the 
variance of r. 


The standard deviation of a random variable is the square root of its 
variance. The standard deviation of a random variable measures how 
close the random variable is likely to be to its expected value, 


The variance of an uncertain future event is defined in terms of proba-_ 
bility beliefs exactly as variance was defined, in terms of objective 
probabilities, for a random variable. If the entries in the second column 
of Table 2 represented probability beliefs, then .0034 would be the variance 
based on these probabilities, As before, standard deviation is the squ 
root of variance. Standard deviation, in this case, measures the degree of 
uncertainty associated with the future event. 


In Chapter XIII the standard deviation is compared with other measures 
of risk and variability. For most of the measures considered, the efficient 
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portfolios produced by using standard deviation are to be preferred. 
‘Some measures which seem reasonable offhand produce completely 
Wsatisfactory portfolios, One of the measures considered, the semi- 
deviation, produces efficient portfolios somewhat preferable to those of the 
Hundard deviation. Those produced by the stanJard deviation are 
Wiisfactory, however, and the standard deviation itself is easier to use, 
More familiar to many, and perhaps easier to interpret than the semi- 
Weviation, 

The semi-deviation is the topic of Chapter IX. The discussion in that 
shupter draws heavily on results derived in the present and intervening 
*hapters concerning the standard deviation. 

The primary concern of the present chapter is with the characteristics 
Wf securities which determine the standard deviation of a portfolio. A 
‘Wiies of intermediate results will be presented which build up to the general 
WMiationship between security and portfolio. 


Notation 


Home conventions concerning notation will be useful in subsequent 
Wetions of this chapter. 

‘Op the outer ring of the wheel in Figure 3 we have a random variable 
hich we might refer to as r. On the second ring of the same wheel we 
fave a random variable equal tor minus the expected value of r. We shall 
ffey to the second random variable as r’. In general, if we have a random 
Wiuble p or q or r, we shall refer to the difference between the random 
‘alluble and its expected value as p’ or q’ or r’, respectively, The third 
Yip on the wheel in Figure 3 generates a random variable equal to the 
Hjuare ofr’. This will be referred to as the random variable (r')*. By 
Welinition, the variance of r is equal to the expected value of (r’)®, Simi- 
Wily, the variance of p or q is the expected value of (p')? or (q')2, respec- 
lively. 

The following are two important facts about any random variable r' 
1H p! org’): 


{}) the expected value of r’ equals zero; and 
(2) the variance of r' equals the variance of r. 


Paoor oF (1): r’ equals r minus the expected value ofr. This may be thought 
4 the sum of two random variables, namely, r plus another random variable 
‘ehiel always equals —(the expected value of r). Applying our result on the 
fn of two random variables, we find that the expected value of r’ equals the 


‘}peeted value of r minus the expected value of r, which equals zero, 


Veoor oF (2): The variance of r’ is, by definition of variance, equal to the 
Hipwrted value of (r' minus the expected value of r’)', But the expected value 
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of r' is zero; so the variance of r’ is the expected value of (r')?. But this is also 
the variance of r. 

Concepts such as “the expected value of r” and “the variance of r” 
will occur repeatedly. It will be convenient to let “expt” stand for the 
phrase “expected value of” and let “var" stand for the phrase “the 
variance of.” For example, 


expt(r) refers to the expected value of r, 
var(r) refers to the variance of r, 

expt(r’)? refers to the expected value of (r')?, 
var(p +) refers to the variance of p + q. 


The reader should have lite difficulty in remembering the definitions of 
expt and var. Later it will be necessary to adopt a more compact but 
less mnemonic notation. 

With our present notation the discussion concerning r’ can be sum- 
marized as follows: 


r= r—expt(r), 
expt(’) = 0, 
var(r’) = var(r). 
There are a number of conventions in use for representing the phrase 
“A multiplied by B.” These include: 
A times B, 
AXxB, 
A+ B, and simply 
AB. 
The notation A x B is rarely used in mathematical literature, since tl 
symbol x frequently has other uses. We shall never use A x B 
fepresent A multiplied by B. Thus far we have most frequently used the 
phrase A times B. It will be increasingly desirable to use the moi 
compact notations A- B and AB. The latter, being the more compact, 


is generally preferred. The dot (:), however, is sometimes used to avoid 
ambiguity. For example, 


expt(Ag + Br) = A- expt(g) + B+ expt(r) 
is clearer than 


expt(Ag + Br) = A exptig) + B expt(s). 
A CONSTANT Times A RANDOM VARIABLE 


The outer ring of the wheel in Figure 4 genetraes a random variable /, 
The second ring of the wheel generates a random variable w which 
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Whevitably 3 times as large as r, w may also be referred to as 3r, The 
IWird ring on the wheel generates the random variable w’ = w — expt(w); 


4. The variance of Jr. The first ring generates a random variable r. The 
‘Srond ring generates a random variable w= 37, The third ring generates w’ = w 
= tapi). The fourth ring generates (w’)8. Expt(m')? equals var(3r). 


for example, w = .30 is .27 greater than the expected value of w. The 
fourth ring generates the random variable (w’)?. The expected value of 
the tandom variable on the fourth ring is the variance of w: 


var(w) = expt(w’)?. 


I the last chapter we studied the relationship between the expected 
Ylvie of r and the expected value of w. We found that, for w = 3r, 
fpllw) = Jexpt(r). In general, we found that 


expt(Ar) = A+ expt(r). 


In the present section we shall study the relationships between the 
Handard deviations and variances of w and r. We shall find: 


| Iw = Ar, then the standard deviation of w equals A : (the standard 
Heviation of r). 


2 If w = Ar, then the variance of w equals A? (the variance of r); 
Wat is, varQw) = A? var(r). 


Tor cxample, if the standard deviation of return from $1 bet on the spin 
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of a wheel is .2, then the standard deviation from a $5 bet is (5)(.2) = 1. 
The variance of the $1 bet must be (.2)* = .04. The variance of the $5 
bet may be calculated in two ways. One way, we can apply the formula 
in 2, above, to find that the variance of the $5 bet is (5)*.04) = (25.04) 
= |. The other way, we can use the fact that the standard deviation of 
the §5 bet has already been determined to be 1; hence the variance of the 
$5 bet must be (1)*= 1. Properly, both methods produce the same 
result. 

It is easiest first to verify the relationship between the variances of w 


and r. The relationship among standard deviations then follows as a 
corollary. 
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ul 
Vvar(w) = the standard deviation of », 


Vvar(r) = the standard deviation of r, 
from which the corollary follows immediately. 


The theorem and corollary stated and proved above for a random 
‘ariable can be rephrased to apply to past series and probability beliefs: 


| If-series | is always A times as great as series 2, then the standard 
deviation of series | is A times as great as that of series 2; whereas the 
‘Wtlance of series 2 is A? times as great as the variance of series 1. 

? Ifrisan uncertain future outcome about which an individual holds 
# Consistent set of probability beliefs, and if w is an outcome which the 
individual believes must inevitably be A times as great as r, then the stan- 
dard deviation of w is A times as great as 7; whereas the variance of w 
i) At times as great as that of r. 


Theorem: Ifr is a random variable generated by any wheel, A is any number, 
and w is the random variable which inevitably equals 4 r, then 
var(w) = At var(e). 


Proor: We start by showing that w’ = Ar’. By definition, 


ww" = w — expl(w), 


‘The proof for the case of random variable can be used, practically word 
But w = Ar and expt(w) = A expt(r), $0 


toy word, as a proof for the case of an uncertain event subject to a con- 
Wilent set of probability beliefs. The proof for the case of a random 
variable becomes a proof for the case of a past series with little more than 
the substitution of the word “‘average” for the phrase “expected value.” 


wi = Ar— A> exptir). 
But Ar — A expt(r) = Alr — expt(r)], so 
w! = Aly — expt(r)). 
Ii makes sense to speak of “expected return per dollar invested” or 
“everage return per dollar invested” or “‘standard deviation of return per 
Wollar invested." But it does not make sense to speak of “variance of 
feturn per dollar invested.” If the expected return from $1 bet on the 


spin of a wheel is 01, then the expected return from $10 is .1, the expected 
feivin from $100 is 1,0, and so on, In all cases the ratio 


Since r — expt(r) is what we refer to as r', we have 
w' = Ar’, as we sought to show. 
If we square both sides of this equation, we find that 
(wi = AK 


This says that the random variable (w’)* is always equal to (is the same thing as) 
the random variable 4*-(r')?. Hence 


expected total return 
amount bet 


i the same, .01. Similarly, the ratio 


expt(w’)? = expt(4?r’?), 
or, equivalently, y 
expt(w')®? = A? expt(r’)*, 


But expt(w')? is the variance of w, and expt(r')? is the variance of r; therefore standard deviation of total return 
var(w) = AP: var(r). QE, amount bet 
Conoitany: If r is a random variable generated by any wheel, A is 
number, and w is the random variable which is inevitably 4 « r, then the stan 
deviation of w is A times the standard devi 


femeins constant as the size of bet varies, This is not true of variance. 
If the variance of a $1 bet is, say, .001, then the variance of a $10 bet is -1, 
tie variance of a $100 bet is 10, the variance of a $1000 bet is 1000. 
Depending on the amount bet, the ratio 


Proor: The theorem on variances states that 


var(w) = A? var(r). 


variance of total return 
Hence a ae ee 


Vvarw) = A+ Vvartr). amount bet 


Way be 001, .01, .1, or 1. 
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Itis standard deviation rather than variance which is directly comparable 
with a past series or a set of probabilities. The high and low returns of 
the series in Figure 1 are about two standard deviations removed from the 
average of the series. If we change the unit of measurement from dollars 
to pennies (if we replace .5 by 50 on our graph), the high and low returns 
femain about two standard deviations removed from the average. The 
variance, however, goes from being smaller than the standard deviation 
(.127 vs. .357) to being larger than the standard deviation (1270 vs. 35.7). 

It is frequently more convenient to work with variance than with 
standard deviation. Relationships between securities and portfolios are 
generally simpler when expressed in terms of variances rather than standard 
deviations. The variance is less clumsy to deal with in most proofs. In 
computing efficient portfolios variance is used until the last, when variance 
is translated into standard deviation for the purpose of presentation. Of 
course, any relationship expressed in terms of variances can be translated 
into terms of standard deviation by substituting (standard deviation)? for 
variance. We shall frequently express and prove relationships in terms 
of variances, leaving the translation to standard deviations for the reader. 


COVARIANCE AND CORRELATION, 


The definition of the covariance between two random variables is 
illustrated by the wheel in Figure 5. The outer ring of the wheel generates 
a random variable g. The second ring generates a random variable r. 
The third ring generates the random variable ’ equal to q — expt(q). 
The fourth ring generates r’ equal to r — expt(r). The entries on the fifth 
ting equal the product of the entries on the third and fourth rings. The 
fifth ring, in other words, generates the random variable q’r'. The 
covariance between 9 and r is defined to be the expected value of the 
fifth ring. 

The covariance between q and r, then, is the expected value of q'r’. 


It will be convenient to let cov(g, r) represent the phrase “the covariant 
between g andr." Thus 


cov(g, r) = expi(r'g’). 


A more compact notation for covariance will be adopted later, whet 
needed 

Covariance can also be defined in terms of a table of probabilities. 
Table 4, for example, values of g and q' are written across (he top; values 
of rand r’ are written along the side. The entries of the table indicate t 
Probabilities of combinations of g and r. Thus the entry at the inte 
section of the row with r = .04 and column with g = .05 indicates thal 
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{his combination of g and r has one chance in six of being the result of the 
spin of the wheel in Figure $. To compute covariance from such a table, 


Fiquie 5, The covariance of gand.r. The outer band generates the random variable g. 
Th second band generates the random variable r. The third band generates q’ = 
J tpl). The fourth band generates r’ =r — exptir). The fifth band generates 
4) The expected value of gr’ is the covariance between g and r. 


Wwlliply r’ times q’ times the probability of the particular combination of 
yond rand sum all such products. For Table 4, for example, 


©ov(q, r) = (1/6 —.035)(-.075) + (1/6) —.065) —.075) 
+ (1/6)(.015)* + (1/3.035)(.065) + (1/6)1.055)(—.035) 
= .001725. 


The covariance between two past series is defined analogously to the 
Syyariance between two random variables. If the entries listed around 
the first and second rings of the wheel in Figure 5 were past returns on two 
‘eurities, then the covariance between these two series would be the average 
#f the entries on the fifth ring of the wheel. If the entries in Table 4 
‘presented relative frequencies rather than probabilities, the covariance 
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would still be computed as above, The same would also be true if the 
entries of the table represented probability beliefs. 


Ht Ag minus expi(4g) = Alg — expt(g)] = Aq’; similarly, Br — expt(Br) = 
*\ hence 
cov(g, Br) = expt(4q' Br’) 
= AB expi(q'r’) 
= AB cov(g, 1). QED. 


Taste 4 


TABLE OF PROBABILITIES is 
Ninilar theorems and proofs apply to past series and outcomes subject to 


Hrobability beliefs. 
q 
| a | : %, The coefficient of correlation for two random variables q and r is defined 
0: ‘ 
+ the ratio 
r ’ covariance between q and r 
q (standard deviation of g) times (standard deviation of r)_ 
LY 5 ‘The same definition applies to the correlation between two series or two 
075 035 O15 065 PP n 
Wheertain future events. As the reader will prove for himself later 
{Paercise 4 at the end of the chapter), the correlation coefficient is always 
—.05 —.075 16 Wiween —1 and +1 inclusive. 
se ‘i Ha 6 16 If q is always an exact positive multiple of r, then their correlation 
6 ‘035 13 preiiicient is +1. If g is an exact negative multiple of r, so that q is 
ce ‘055. V6 Jpwest when ris highest, then their correlation is —1. If g and r tend to 
‘ove up and down together, but not in perfect unison, then their correla- 


{WN coefficient is greater than zero but less than +1. If g and r move 
Wilependently of each other, then their correlation coefficient is zero, 

When two variables have a zero correlation coefficient, they are said to 
W ineorrelated? 


The covariance is a measure of the extent to which two sets of numbel 
tend to move up and down together. When both g and rare above the 
means, g’+r’ is positive; when both g and r are below their meat 
q’ ‘1’ is also positive; when one variable is above and the other below i 
mean, q’ +r’ is negative, Thus, if is usually above its mean when r 
above its mean, and below its mean when r is below its mean, then tl 
covariance of q and r is positive. If'g is usually above its mean when r 
below its mean, and vice versa, then their covariance is negative. If t 
‘outcome of g has no influence on the outcome of r—like random varial 
generated by two independent wheels—then their covariance will 
zero, 

‘The following theorem concerning covariances will be needed later: 


/ {wo variables which are uncorrelated are not necessarily independent, If two 
‘W/iibles are independent, the value of the outcome of one variable has no influence on 
‘WW ovicome of the other, as is true with the random variables whose probability 
PHPbulion is given in Table (a). It is possible for expt(q’r’) 1o be zero, hence for their 

elation to be zero, even if q and r are not independent, This is illustrated by 

‘hie (b), Independence always implies zero correlation, but zero correlation does 
Y tiways imply independence. 


Taste (a) Taste (b) 


‘Tueorem: Ifg andr are any pair of random variables generated by any wi 
and if A and B are any two numbers, then the covariance between the rani 


variables Ag and Br equals A times B times the covariance between q and | ° | $41 = | 0 | Pr 
that is, 
(Aq, Br) = AB- cov(q,r). 
ort $i ) Deitel | Ve 116 =! V8 8 
PROOF: cov(4g, Br) equals the expected value of | 4 | 18 ° 1p 
1116 | 8 | 116 +1] 18 1/8 


[4g — expt(4g)] times [Br — expt(Br)]. 
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‘The correlation coefficient provides a more easily interpreted measure, 
than does the covariance, of the extent to which two variables tend 
to move together. The covariance, however, proves to be more con- 
venient in formulae and proofs. Results expressed in terms of covari- 
‘ances can be translated into terms of correlation coefficients by using the 


relationship 


(covariance of g and r) equals 

[(the correlation coefficient of g and r) times 
(the standard deviation of q) times 

(the standard deviation of r)]. 


VARIANCE OF A SUM OF Two RANDOM VARIABLES 


The first and second rings of the wheel in Figure 6 generate rand 
variables g and r, respectively. The third ring generates the rand. 


Figure 6. The random variables q, 7. 54's 1's 5" 


variables = +r. Theentry on the third ring is, in every case, the 
of the entries on the first and second rings. The fourth, fifth, and 
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‘ings generate, respectively, the random variables q’, r’, and s'; where 

7 = 4 — expt), 

rv =r —expt(r), and 

s = 5 — expt(s). 


In the last chapter we studied the relationship between the expected 
velues of g, r, and s. We found that expt(s) = expt(g) + expt(r). In the 
nt section we shall study the relationships between the variance of s 


‘wid the properties of g and r. 
Let vs first note that, as illustrated on the wheel in Figure 6, 


sad tr. 
‘Thal this is true in general may be shown as follows: By definition, 
= (g +r) — expt(g +7). 
‘Applying the theorem on the expected value of a sum, we get 


Ss =q +r— expt(q) — expt(r). 
Hearranging terms, ae ri 


=q — expt(g) + r — expt(r). 
Wy = expt(g) = 9’ and r ~ expt(r) =r’, so 


s' =q' +1, as we sought to show. 


own in high school algebra texts that, if A and B are any two 
hers, 
(A + BY = A + BY + 2B. 


is, the square of (the sum of A and 2) equals (the square of A) plus 
tquare of B) plus 2 times A times B. For example, if A = 3 and 
A then 

G+ 4) = 7 = 49 and 
PHP $2-3:°4=94 16 + 24 = 49, 


ling that the number resulting from the operations (A + B)* is the 
‘#) the number resulting from the operations A? + B? + 2AB. 
‘Weare now ready to state and prove the theorem concerning the variance 
‘Whe tum of two random variables. 


Joni: If q and + are any pair of random variables generated by a 
| sid sis the random variable which always equals Bes q fe 
ince of g plus the variance of r plus twice the 


var(s) = var(g) + var(r) + 2covig, r). 
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Proor: The variance of s equals the expected value of (s')', But s’ equals 
g’ +1r',s0(s'* = (q' + r')*, which is the same thing as (q'? + (r'F + 2g'r’, 
Hence 

(SF =P + GP + gr, 
from which it follows that 
expt(s')? = expt(r’)? + expt(g')* + 2 expt(q’r’). 
But expt(s’)? = var(s), expt(r')? = var(r), expt(q’)* = var(g), and expt(q'r') = 
cov(g,r). Therefore 
var(s) = var(r) + var(q) + 2cov(g, r).. QED, 


Corottany: If g and r are uncorrelated—if their correlation coefficient and 
(therefore) their covariance equal zero—then the variance of s equals the varia 
of g plus the variance of r; that is, when cov(q, r) = 0, then 


vat(s) = var(q) + var(r). 


Exame es: If the variances of both g and r equal 1, and the two variables 
uncorrelated, then the variance of (q + r) equals 1 + 1 = 2. The stand: 
deviation of g + r equals W2, or about 1.41, Compare the above with the 
in which the two variances still equal 1, but their covariance equals 1/2. TI 
variance of q +r then equals 1 + 1+2-1/2=3. The standard deviation 
of g +1 in this case equals V3, or about 1.73. Other things being equal, t! 
higher the correlation between two variables, the greater is the variance of thei 
sum. 

If the variances of the two variables equal 1 but their covariance equal 
—1/2, then the variance of the sum equals | + 1 —2:1/2= 1. Negati 
covariances reduce the variance of the sum, whereas positive covariances incre 
the variance of the sum 

For an example with variance not equal to 1: if var(q 
covig, r) = 2, then 


4, var (r) = 9, al 
varig+r)=44+942-2=17. 
The standard deviation of g + r equals V17, or approximately 4.12. 


VARIANCE OF A WEIGHTED SUM OF TWO VARIABLES 


In the last chapter we studied the expected values of weighted sums) 
w= Ag+ Br. We found that expt(w) = A -expt(g) + B- expt(r). 
the present section we study the relationship between the variance of 
and the variances and covariance of q and r. 

‘Tweorem: Suppose that g and r are any pair of random variables general 
by any wheel, that 4 and B are any two numbers, that w is the random varial 
which always equals 4q + Br; then 

var(w) = A?- var(g) + BY vai(r) + 24B- cov(g.r). 

PROOF: According to the theorem on the sum of two random variables, 

var(Ag + Br) = var(Aq) + var(Br) + 2 cov(4g, Br). 
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Hi in previous sections we showed that var(4g) = A? var(q), var(Br) = 
WH vary), and cov(Ag, Br) = AB covig,r). Therefore 

var(Ag + Br) = A* var(q) + BY var(r) + 2AB- cov(g, r). 

As usual, the theorem applies to past series and probability beliefs: 

1, If series 1 is a weighted sum of series 2 and 3, with A and Bas weights, 
then the variance of series 1 equals A* times the variance of series 2 plus 
‘# times series 3 plus 248 times the covariance between series 2 and 
Mties 3. 

2 Ifa future outcome i, subject to a consistent set of probability 
beliefs, must inevitably be a weighted sum, 4g + Br, of two future events, 
then 
var(w) = A var(g) + B yar(r) + 2ABcov(q, r). 


‘Tit VARIANCE OF A SUM OF THREE RANDOM VARIABLES 


In this section we study the variance of a random variable s equal to the 
$i of three other random variables p,q, and r. To start with we show, 
‘Peering the random variable s’ = s — expt(s), that 

Arps 
Wy definition, 
S=(ptqtr)—exp(pt+ 9 tr. 
Wee expt(p + q +r) = expt(p) + exptig) + exptir), we have (after 
Harranging terms) 
5! =p — expt(p) + q — expt(g) + + — expt(r) 
Wi, since p’ = p — expt(p), etc., we arrive at the sought-for relationship: 
S=aptgtr. 
‘To discuss the variance of three random variables we shall need some 


_ for the square of the sum of three numbers A+ B+ C. One 
‘ula for the square of the sum of three numbers is 


(A+ B+ CP =A? + BY+ C+ 24B + 2AC + 2BC. 
‘} weond formula, closely related to the first, is 


(A+ B+ C= A-A + AB+ AC + BA + Bs B+ BC 
+CA+CB+C-C. 
‘iy ‘ender may find Figure 7 helpful for remembering the latter formula. 


| Hand Care written along the top and the side of the square in Figure 7. 
hy products of the numbers at the side and top are written in appropriate 


a 
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‘Theorem: If p,q, and r are any triplet of random variables generated by any 
hel, if A, B, and C are any three numbers, and if w is the random variable 
Wet Ba + Cr, then 

var(w) = A®- var(p) + AB: covip,q) + AC > cov(p, r) 
+ AB» cov(q,p) + Bt: var(9) + BC: cov(g, r) 
+ CA-cov(r, p) + CB cov(r, gq) + C?> var(r) 


places within the square. (A + B + C)?is the sum of all these products. 
The first version of (A + B + C)?is obtained from the second by making 
replacements such as A? for A~ A and 2AB for AB + BA. 

Tueorem: If p,g, and r are a triplet of random variables generated by any 
wheel, and if sis the random variable p + q + r, then 

var(s) = var(p) + cov(p,q) + cov(p,r) + cov(g, p) + var(q) 


HF, equivalently, 
+ covlg, r) + covlr, p) + covlr, q) + var(r), 


yatlw) = AP vartp) + BY var(g) + C?- var(r) 
+ 2AB -cov(p,q) + 2AC- cov'p,r) + 2BC cov(g,r). 


Paoor: According to the result concerning the variance of a sum of three 
Teom variables, 


vail dp + Bg + Cr) = var(Ap) + var(Bq) + var(Cr) 
+ Zcov(Ap, Bq) + 2 cov(Ap, Cr) + 2cov(Bg, Cr). 
ti var(4p) = A® var(p), etc., cov(4p, Bq)'= AB cov(p, 9), etc.—from which 


or, equivalently, 
var(s) = var(p) + var(g) + varir) + 2cov(p.q) + 2cov(p, r) + 2 cov(q, r). 


The second version of var(s) follows immediately from the first if we mal 
replacements such as 2cov(p,q) for covip,g) + cov(g, p). The first version 
of var(s) is similar to the square in Figure 7, and, in fact, is derived from it. 


ilaws the second version of var(w). The first version follows from the second 

feplacements such as cov(p, g) + cov(q, p) for 2 cov(p, 9). 

Avcording to the first version of var(w) in the above theorem, the 
‘efinnce of w-is the sum of all covariances—counting cov(p, q) and cov(g, p) 
» ilifferent covariances, and also counting var (p)as a coviarance, cov(p, p). 
Aveording to the second version, var(w) is the sum of all variances plus 
tyie the sum of all distinct covariances—not counting cov(g, p) as 
Winet from cov(p, g). When the variables p, g, and r are uncorrelated, 


var(w) = A? var(p) + B® var(g) + C*var(r). 


Bit Ba + BB +B 


Cleca 40-8 40-6 


The theorem on the variance of a weighted sum of random variables can 
be rephrased to apply to the weighted sum of past series or future 
Wain events. 


Figure 7. The square of a sum (A + B + C)tis the sum of all products within 
square, 


The theorem concerning var(s) may be proved as follows: 


var(s) = expt(s’)® = expt(p’ + 9 + r')*. Noration FOR LARGE NUMBERS OF RANDOM VARIABLES 


But this in turn equals the expected value of 
ON +g ter +a +a +f +r +r +) 


The expected value of this sum is the sum of the individual expected val 
But, since expt(p'}' = var(p), expt(p’ ¢') = cov(p, q), etc., we have the desi 
result: 


hull revert to the notation used at the end of the last chapter for 
ing With large numbers of random variables. We shall refer tor, as a 
fandom variable, rz as a second random variable, rs, as a fifty-second 
om variable, and ry as the last of N random variables. We shall 
WH 10 A, as 2 first number, A, as a second number, Ay as a sixty-fourth 
‘Pyitber, and Ay as the last of N numbers. We shall let rj =r, — 
1), rh = "2 — expt(r,), and so on. 

| will also be convenient to use a more compact notation for variances 
fovariances. We shall let V, represent the variance of r,, V, represent 
yatiance of r,, Vy represent the variance of ry. 

‘Ty fepresent the covariance between r, and r, we shall use the symbol 


var(s) = var(p) + cov(p, g) + cov(p,r) + cov(g, p), etc. 


VARIANCE OF A WEIGHTED SUM OF THREE RANDOM VARIABLES 


‘The theorem for the weighted sum of three random variables foll 
casily from the theorem on the ordinary sum of three random variable 
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Twworem: Suppose that ry ry.rs°°°,ry are any N random variables 
jated by some wheel, suppose that s is the sum of these random variables; 
fi the variance of s is 
vat (3) = 01, + Oie + O19 + + Oy 
$0n + Ont On t's + Oy 
+0 + On + Oy + °° + Hy 


6. @ is the small Greek letter sigma. 04, may be read either as “the 
covariance between r, and 7,” or simply as “‘sigma 1,2.” The covariances 
between r, and ry, or r, and rs, or rz and ry are similarly represented by 
243, O25, and ogy, respectively. 

Since the variance of r, is the same as cov(r,, r,), we could represent this 
variance by o,, as well as by V;. 


VARIANCE OF A SUM OF ANY NuMBER OF RANDOM VARIABLES 


toy toy, toys ts teyyy 


H, Muivalently, 
var(s) = Vit Vat Vat >> + Vy + Qty + ig t+ > + ny 
+ Qoag +> + + Igy, ete. 


Suppose that s is the sum of V random variables: 
Sentntnyt tin 


then 


Ay Ag Aye 


Sentnytiynte ti, 


A\A, +AyAr +A\Ay + 


This may be shown as follows: 


Sa(ntratiyts try) expt, treet ts try) JtAgA, +AgAy 


+ApAy + 


Applying the theorem on the expected value of a sum, and rearrangin 
terms, we obtain 


cS 


Iy|tAsA, +AgAp +AgAy to. oo tAgAn 


Sf = ry — explt(ry) +r, — expt(r,) ++ + ry — expt(ry), 
from which follows 
Sarntihtec ti. 

To study the variance of the sum of any number of random variables 
need to know the general formula for the square of a sum. One versi 
of the square of the sum of N numbers is 
(Ay + 4g + Ay te + AN) = ALA, + AVAn + Aa t+ Ay 
+ Agd, + Agdy + Ardy t+: 1 + Apdy 


Aj} Awdy +4aAp sacsttiscat had 


is similar to that employed previously for 
fm of three variables. The variance of 5 is the expt(s’}t. But (6) = 
Vib r+ + trp Expanding this sum of a square according to 
formula presented in Figure 8, taking the expectation of the resulting 

lon, making substitutions such as o,, for expttr,r,) and ox for expt(rir.), 
ive at the first formula of the theorem. The second version of var(s) 
i) bw obtained from the first by substitutions such as V; for o,, and 2c,, 
aah on 


This may be thought of as the sum of all entries in the square in Figure 

A second version, obtained by making replacements such as 47 for Ay 

and 24,4, for A,Ay + 4eAj, is: 

(A, + Ap + Ay +° + Ay)* 
= (A)? + (Ad + (Ad)? + + (Anh 

+ 2A\Ay + 2AAy + °° + + 2A AW 

+ 2Agds + °° + 2AsAN, 


Hf sil the random variables are uncorrelated, the theorem asserts that the 
‘ihe of the sum is the sum of the variances: 


Vs) = Vy + Vat Vote + Vy. 
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If all variables are positively correlated, the variance of the sum is greater 
vely correlated, 


than the sum of the variances. If all variables are nej 
the variance of the sum is less than the sum of the variances. 


VARIANCE OF A WEIGHTED SUM OF ANY NUMBER OF VARIABLES 


‘Tueorem: Suppose that ry, yrs," ° ‘sry are any N variables generated 
some wheel, that 4, 
variable Ayr, + Avy 4++ + Ayry. The variance of w is equal to 

var(w) = A,Ayo,, + AAs + Aga +0 + AAS 
+ AAyn + ArAates + ArAates + * + + ArAGry 
+ Aya, + AyAaOay + Assay + °° + AsA yoy 


+ AyAyoys + AyAro ye + AyAsoys ++ «+ AyANeyye 
or, equivalently, 
var(w) = ATV, + AEVa + AVs 4° °° + AR Vy 
$ TAA + 24,AgO,y +> + + 2ALAYy 
+ AAG + °° + 2AAyORy, ete. 
PROOF: The proof of this theorem is essentially the same as that invol 
the weighted sum of three random variables, The first version of the 
for the ordinary sum of WV variables tells us that 
var(Ayr + Aga +++ + Ayry) = COMAGM. Aura) + COMA. Arts) 
+ COMA. Aats) +2 + + COMA, Ayry) + COV(Adr, Ayr) 
+ cov(Arrs, Arr), etc. 
Making substitutions such as 4,401, for cov(Ayry, Ayr) and A,4z0.2 
cov(Ayr,, Agfa), We arrive at the first version of var(w) in the theorem. 
second version follows from substitutions such as ¥; for oy, and 24,01, 
AA + ArAyon- 


APPLICATION TO PasT SERIES AND PROBABILITY BELIEFS 


The theorem on the variance of a weighted sum of any number 
random variables also applies to weighted sums of past series or uncer 
future events. 

Table 5 illustrates the computation of the variance of a portfolio 
sisting of four securities. The securities involved are numbers one thro\ 
four from the analysis of the past series of nine securities presented 
Chapter If. Column (1) of the table presents the amounts invested in 
of these securities, per dollar of portfolio, for the portfolio whose vari 
is to be evaluated. Thus the portfolio consists of 10% security 1, 
security 2, 30% security 3, and 20% security 4. Column (2) 


‘Ay are any N numbers, that w is the random. 


TasLe 5 
VARIANCE OF A PORTFOLIO 


STANDARD DEVIATIONS AND VARIANCES 


@) 
Amount 


@) 


mM 
Amount | Covar. 


(6) 
Covar. 


@) 
Amount 


® 


Covar. 


@) 
Amount 
times 


Q) 
Cov. 


Covar. 


ay 


Amount 
Invested 


with 


Sec. 2 


with 


Sec. 1 


times 


with 
Sec. 4 Cov. 


times 


Cov. 


with 


Sec. 3 


times 
Cov. 


021 0021 029 


0053 


053 


(6) Amount i 
(7) Amount * Sum 
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covariances between security I and the securities of the analysis. If we 
let o,, represent the variance of security 1, 01, represent the covariance 
between security I and security 2, etc., column (2) tells us that 0), = .053, 
42 = 021, a3 = .029, 044 = .049. Column (4) similarly presents on, 
24, %3, Oy4; Column (6) presents oy, Og, Ogg, Fax Column (8) presents: 
Gey Gm Cex: az Column (3) presents the amounts invested in cach 
security multiplied by the covariance of the security with security 1. Thus 
(the amount invested in security 1) times (the variance of security 1) 
.0053; (the amount invested in security 2) times (the covariance betwe 
the second security and the first security) is 0084. If we let Ay rep 
the amount invested in the first security, A, the amount invested in tl 
second security; yin the third; A, in the fourth, then the sum of columi 
(3), .0322, equals 
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Ay, + Ag: + Astis + Ace 


If we multiply this sum by Ay, the amount invested in security 1, we 
.00322, which equals 


AyAyOy, + ArAaon + ArAa%3 + AA 
Similarly, the first four entries in column (5) give us A,9q1, As0s2» As 
Ay. The fifth entry gives us 

Ayn, + AgOzn + Asa + AsO 
The seventh entry of column (5) gives 
yA, + pda + 424s + Arm 
If we add together the four entries in row 7, we obtain 
AAyoy + ArAatye + Aras + AA 

+ ApAion + Aarne + 4243005 + Arde 

+ AsAyon + 4341032 + A4s0m + Arden 

FAA + AgAr%e2 + Aadaer + AAU 
That is, we obtain the variance of the portfolio, .03576. 


DertveD COVARIANCES 


To find portfolios with minimum standard deviation for various lev 
expected return, an analysis must start with expected returns for 
security, variances of return for each security, and covariances of 
for each pair of securities. An analysis of 100 securities requires | 
expected returns, 100 variances, and almost 5000 covariances. 
reasonable to ask security analysts to summarize their researches in | 
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‘erelully considered expected returns, and 100 carefully considered 
Wriances of return. It is not reasonable, however, to ask for 5000 care- 
Wily and individually considered covariances. Yet covariances are 
‘ential to an analysis of efficient portfolios. They express relationships 
. Securities; they guide the analysis to properly diversified port- 

los, 

The thousands of covariances needed for large security analyses make 
Hf eiential that 


{}) the relationships between securities be portrayed in a manner less 
Pyiibersome than individual covariances, and 

(2) the required covariances be derived from this portrayal by means of 
# electronic computer. 


‘Tie nature of such a portrayal, and the manner by which covariances can 
Wy deduced from it, may be made clear by example. 
A (hypothetical) staff of security analysts, together with a (hypothetical) 
id analyst, have decided that the relationships between the securities 


thelr analysis can be represented by the following three types of 
wlormation. 


| Figure 9 represents their consensus as to the probabilities of various 
, changes in a certain index during the coming 12 months. They 
| for example, that there are 10 chances in 50 that there will be about a 


es eR oe rane nee ees 
sw 0-0 -30 -20 =10 o 2 0 8 0 Oo MH hw HW 
A vecrease % increase 


Figure 9. Hypothetical probability beliefs concerning changes in an index. 


\nerease in the index, 1 chance in 50 that the index will fall by 50%, 
Hianees in SO that the index will stay approximately the same. 

} Vigures 10a and 10b represent the analyst team’s consensus con- 
Wig the relationship between the change in the index and the expected 
1// (capital gains plus dividends) on two securities. ‘Thus, if the index 
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does not change, they expect the return on security (a) to be 10%, while 
that on security (b) is expected to be 0%. If the index increases by 40%, 
the return on security (a) is expected to be 309%, while that on security (b) 
is expected to be 80% The analyst team has developed similar relation- 
ships between security return and change in index for each of the othe 


securities in the analysis, 


s 
return 


on 
security (a) 


ss 8 8 


-10 -8 -0 - -2 
t 1 


5 Sa alot ey 
nwo © © 1m 1 10 
# change in 
index 
ob 
(a) 
* wu 
return 
on oF 
secunty (b) 
wh 
n 
(ese ee SF pee ye ey 
-10 0 0 -0 -2 oo 0 % 1 1 MO 
-0 % change in 
index 


Figure 10. Relationships between index and securities. 


3. It is not certain that the return on each security will agree with 
expected relationship between security and index shown in Figure 
Figure 11 shows their consensus as to amounts by which the returns 
securities (a) and (b) are likely to deviate from the relationship in Figu 
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‘Ths itis felt that there is | chance in 20 that the percentage return on 
‘eurity (a) will be 30 greater than the amount indicated in Figure 10a; 
hile there is | chance in 22 that the percentage return on (b) will be 40 
iin than indicated by Figure 10b. It is assumed that, if security (a) 

higher than would be expected from its relationship with the index, it is 
Wither more nor less likely that the security (b) will be higher than expected 
from the index. In other words, the correlation between the two securities 


RE TS fon nda 
~0 -0 =) -0 0 


Figure 11. Deviations from expected. 


be explained in terms of their correlation with the index. It is not 
Wary for the analyst team to develop complete distributions such as 
in Figure 11, Only the variance, or standard deviation, of such 
butions is required for the derivation of covariances. 


Thom the above information the expected returns, the variances of 
iN), and the covariances of returns for all securities can be deduced. 
th analysis includes 100 securities, the analyst team must supply 


distribution as in Figure 9, 
10) sets of relationships between securities and index, as in Figure 10, 


1 variances for distributions of the sort illustrated in Figure 11. 


iMormation can be quickly processed by an electronic computer to 
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derive the 100 expected returns, the 100 variances, and the 4950 covariances: 
needed for the analysis. The derivation of expectations, variances, and 
covariances is based on formulae which are corollaries of the discussions 
this and the last chapter. These formulae are presented in the footnote. 
‘A portrayal of relationships, such as illustrated above, is usually refe 
to as a model, This sort of mathematical mode! is similar to a physi 
model in that it is a representation; it differs from a physical model in tl 
t is fashioned out of logical relationships rather than wood or clay. 
There are a number of sophistications which an analyst team might wist 
to add to our hypothetical model, for example: 


2. The relationships between the performance of a security and that of 
some (economy-wide or industry-wide) index need not be represented by a 
jiynight line. “Any curve which the analyst regards as a better representa- 
ion of the relationship may be used. 

1 The probability distribution in Figure 11a need not be the same in 
‘il circumstances. It can, for example, have greater variability when the 
Judes is high, less variability when the index is low. 


None of these modifications imposes a substantial burden on the elec- 
Wenie computer. They may, however, increase the requirements of the 


‘Wialyst team. After a point, complexity adds work without appreciably 
‘eteasing the value of the analysis. 


|, Rather than having one underlying factor, the general prosperity 
the market as expressed by some index, the analysis could have sever 
underlying factors. Securities could be classified into industries; indust 
prosperity could be explained by factors such as military expenditures ai 
consumer income. The correlation between securities within the sai 
industry would be explained in terms of their sharing in the indus 
prosperity. The correlation between securities in different industr 
would be explained by correlations between industry prosperities. 


EXERCISES 


1 Without referring to the text, write out the formula for the weighted sum of 
fandom variables, 


1} Using the variances and covariances in Table 5, determine the variance of 


4) 00 & portfolio consisting of 40% security 1, 30% security 2, 20% security 


2 The return on security (a) may be expressed as ‘sl 10% security 4. 


rem Ay t Al + 
where ry is the return on security (a), [is the value of the index, 4, and 4, are 
constants, and w is the random deviation illustrated in Figure 11. 
on security (b) may be expressed as 
r= By + Bl to. 

By assumption, cov(u,) = 0, cov(l, 1) =0, cov(l, v) = 0, expt(u) = 0, expt(e) 
The expected values of r, and ry are 

explt(r,) = Ay + Ay expt). 

expt(r,) = By + By expt). 
Subtracting their expectations from r, and ry, we get 


} Show that, if 4 and B are both positive, the correlation coefficient between 
snd Br is the same as that between g and r. 


4 Move that the correlation coefficient must be between ~1 and 1, inclusive, 

Start with the case in which both g and r have a variance equal to 1, 
vis case the covariance equals the correlation coefficient. Why’) Consider 
y  r)and var(g ~r). Can variance ever be negative? 


ANSWER 


+ 003. 
m= Al’ +m 
ry = Bl’ +0. 
The variance of re is 
exptteg)* = expt(dgl’ + uF = expla! + expt + 2 expl(sl'4) 
= (4g) var(/) + var(u). 
Similarly, 
var(r,) = (By)? var(l) + var(v). 
The covariance between r, and ry is 
cOV(ra, fa) = Expt Ay!’ + wu) Bgl” + 2) 
= expt(A,Byl”® + Ayl'o + Byl'u + 1) 
= A,B, varll). 


CnapTer V 


INVESTMENT IN LARGE NUMBERS 
OF SECURITIES 


INTRODUCTION 


In the last two chapters we presented formulae for computing { 
expected return and variance of return of a portfolio. These formul 
apply to portfolios of any size, to any pattern of correlations am« 
securities. It may be difficult, however, to see broad principles from 
cursory inspection of these formulae. The present section derives fri 
these formulae rules of thumb concerning portfolios containing lal 
numbers of securities. These rules of thumb provide insight into dos 
don'ts of large portfolios. 

First we discuss the properties of the average of a large number of 
correlated random variables. We see that diversification is extr 
powerful when outcomes are uncorrelated. This case, however, is 
applicable to the risks of insurance companies than to those of the 
investor, To understand the general properties of large portfolios we 
consider the averaging together of large numbers of highly cori 
outcomes. We find that diversification ts much less powerful in this 
Only a limited reduction in variability can be achieved by increasing 
number of securities in a portfolio. 

The discussion of this chapter emphasizes the importance of the 
variance. In portfolios involving large numbers of correlated secu 
variances shrink in importance compared to covariances. A 
adds much or little to the variability of a large portfolio, not accordi 
the size of its own variance, but according to the sum of all its covarii 
with the other securities of the portfolio. 


THe Sum oF THE First V NUMBERS 


It will be convenient in subsequent discussions for the reader to 
the formula for the sum 
1$243444°°°4N. 
102 
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‘The sum of the first NV numbers is equal to 

N-(N 41) 
2 


Fou 


mple, when N = 4, 
NOW +1) 


Sei0 
oh 


Shieh does in fact equal 1 +2+3+44. When N = 10, 
MN +1) _ 10-1 


2 Gagele 
Mehequals1+243+44+5+6+7+8+9+4+ 10. Itislesseasy 
}>eontirm, butequally true, that the sum of the numbers from 1 to 1000 is 


1000)(1001 
OOH 550 


Wool that the formula works for any N is provided in the footnote.t 


DARELATED RANDOM VARIABLES WITH IDENTICAL EXPECTED VALUES 
Hh VARiANces 


Ii the present section we assume that r,, ry. ry, © "ry are N uncorrelated 
lon variables. We also assume that each has the same expected 
le (£) and the same variance (V). Thus 

expt(r,) 
Vw var(ry) = varlrg) = var(rg) = 


Em expt(r, 


= expt(ry + = expt(ry), 


+ = var(ry). 


J We shall prove the formula by induction, That is, we shall show first that it is 
te N © 1, and then show that, if it is true for N = X, it must also be true for 
This establishes the formula for any N. 
fave of N= 1 is trivial: 1-2/2 is indeed the sum of the first one number. 
‘how that it has been shown that 


ce heen 
ee must have 


WADED a KH 1 ED 


+K4+ 


MK + 2K + 
z 


a Ki+3K+2 
2 
lesley 
Es) 
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1 
Also varie) = var [Fa] 
COV Fe) =0, — cOv(rs Ms) =O, COV» Fh) = 05 Cte i 
* 1)? 
We shall let s represent the sum of the V random variables: = (2) var(s) 
santntiat try. _ yar(s) 
5 aa 
We shall let w represent the average of the N random variables; tha N 
“Y, 
ntatntoty es 
ee NV 
N var(w) = <a 
or, equivalently, 
w= 5 
; rl the average (w) of N uncorrelated random variables, each with the 
Sometimes it is convenient to think of w as Variance V, itself has a variance equal to V divided by N. 
1 Above results concerning the expected value and variance of s and 
Wa ¥y be summarized as follows: 
: Mi Ifry ry rs.° © + ry, are N uncorrelated random variables generated 
The expected value of s follows from the formula (developed in Chapter Bosak cack withthe sane expected value Band variance V, Fe 
enpt(s) = expt(r) + expt) + expi(r) +> > + expt(ry) ‘T these random variables and w is their average, then 
= rl ae Di? expt(s) = NE, 
Ei =E+ E+ E+ + ere 
expt(s) = N° E. vars) = os ¥. 
The expected value of w is Maca 
u hat each of the rand iabl hi ted 
ae [to] we that each of the random variables r,, ry. r4 has an expec 
ep) SERPS Of and a variance of 1. If ws is the average of the first three 
1 variables—if w is the random variable equal to (r, + rz + 15)/3 
elit expt(s). q the expected value of ws is .3 and its variance is 1/3. If wg is the 
Of 10 of the random variables, its expected value is still .3 but its 
But expt(s) = NE, so § 1/10. Similarly the expected values of both wyo9 and Wyo99 are 
expt(w) = E. their variances are .01 and .001, respectively. As we consider aver- 
tod ‘Wf Wore and more of the r's, the expected values of the average stay 
Thus the average (x) of NV random saab exch eth ai aE but the variance of the average becomes increasingly close to zero. 
1 eet pais ie eel aeaieal rom oruiaer em ule, implicit in the theorem and illustrated by the example of the 
e variance of $ a 


raph, may be stated as follows: If wy is the average of N un- 
tel ‘fandom variables, each with the same variance and expected 
var(s) = var(r,) + var(ra) + var(rs) + °° * + var(ty) HN increases 
aVeVeVte tv Pietied value of wy remains constant, 
=N-V. ‘Swiltnce of wy approaches zero. 


Chapter IV. Since the random variables are uncorrelated, 
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ory. increasing diversification brings increasing certainty when returns are 
‘Wheorrelated and variances are identical. 

}. Variance of return is not the same for all securities. Suppose that 
the Yariance on the second security of a portfolio is greater than that on the 
fel, that the variance on the third security is greater than that on the 
weond; and that each addition introduces a security with still greater 
‘atinbility. What happens to the variance of the portfolio? It is possible 
Joy the variance of successive securities to increase so rapidly that the 
‘aflance of the portfolio does not approach zero. On the other hand, if 
th variances never exceed an upper limit, the variance of the portfolio 
Will approaches zero. We shall consider first the case in which variances 
‘Way within certain bounds. After this we consider a case in which vari- 
‘#hees increase without limit and the variance of the portfolio does not 
‘*ppioach zero. Even the latter case turns out to be a testimonial to the 
‘Power of diversification among uncorrelated outcomes. 

Wppose that 1,7, 73° > are uncorrelated returns with variances 

Vy Vc ++. Let s be the sum, and n' the average, of the first N of 

(je returns, Suppose that, while the variances V;, V,, Vs, ~~ are not 

— the same, none of them exceeds some upper bound V*. ¥, is 
than V*, Vp is less than ¥*, and so on. 

The variance of s is 


var(s) = V+ Vat Va+-°- + Vy. 
Wil this cannot be greater than 
VOR Vb Vp ES, 
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GENERALIZATIONS 


r, of the last section may be regarded as the amount won or lost 
dollar bet on a first random variable; r, similarly would be the amoun 
won or lost on a second random variable; ry, the amount won or lost o 
an Nth random variable. w then is the total amount won or lost if (1/¥) 
dollars are bet on each of N random variables. 

If N is large, the variance of w is almost zero. Under the conditior 
assumed in the previous section, wide diversification can practical 
eliminate uncertainty. 

There are four major ways, however, in which the returns from securiti 
difler from the random variables of the last section. We shalt disc 
these differences in increasing order of their effect on the results of the I 
section: 


1. Returns on securities are uncertain events rather than rand 
variables subject to known objective probabilities. This implies only t 
the expected returns, variances of returns, and covariances of retut 
referred to in this chapter should be interpreted as based on probabilil 
beliefs rather than on objective probabilities. 

2. Expected return is not the same for all securities. Let E, repre 
the expected value of the return r, on the first security; let £, repre 
the expected value of the return r, on the second security; and so 
Then the expected value of s is 


expt(s) = expt(r,) + expt(r) + expt(rs) + © + * + expt(ry) 


SE th+ ht s+ by. 
Wh equals N- ¥*. 
The expected value of the average w’ of the NV returns is Mm 


vars) << N°". 


expt(w) = i expi(s) variance of wis 


var(w) = (i) var. 


Eththt- st by 
N 


Ho) this can be no larger than 
The expected value of the average then equals the average of the ex; 
values. A portfolio with equal amounts invested in cach of N secur 
has an expected return per dollar invested equal to the average of 
expected returns of its securities. » 

The variance of return on a portfolio does not depend on the ex} 
returns of securities, Hence, as long as we assume that the r’s are 
correlated and have equal variances, we must conclude that the varial 
of return w approaches zero as N increases. Even though expected ret 


lt va 
1 yey 
(ernF 


: 
eres 


N increases, V*/N becomes smaller and smaller, approaching zero. 
ar(w) is smaller than V*/N, ittoo must approach zero as N increases. 
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Thus, if variances stay within limits—if they have an upper bound V* 
the variance of the average of the uncorrelated variables still approact 
zero. Ifreturns were uncorrelated and their variances bounded, suf 
diversification would bring virtual certainty of return, q 

Now let us consider a particular case in which variances increase witha 
limit. Suppose that the variance of the first security is V; that the vai 
of the second security is twice as large as that of the first security, Vv, = 21 
that the variance of the third security is three times as large as that of th 
first, Vj = 3V; similarly that V, = 4V, V; = SV, Ya = 6Y, and so 
As the number of securities increases, the variancx of the last one incr 
without bound. 5 

The variance of the sum s of W of such returns is 


var(s) = var(r,) + var(r,) + var(rs) + + * «+ var(ry) 
VEWS+IVt- + NV 
VI+24+34+°° +N) 


Hiong independent outcomes. A portfolio with variances ranging from 
¥ (0 100Y has a variance of little more than (1/2)¥. (Or, to translate this 
Wolo the more easily interpreted terms of standard deviation, the portfolio 
Sil) standard deviations up to 10 times as great as the standard deviation 
4; hus itself a standard deviation of 1/2, about .71 times the standard 
jation of r,.) As additional securities with still greater standard 
jations are added, the widening diversification prevents the variance 
the portfolio from increasing, keeping it about (1/2) 
Wi is theoretically possible for the variance of the r’s to increase so 
ly that the variance of w rises instead of falls. If, for example, the 
noes of the r’s doubled and redoubled, V, = V, ¥; = 2¥, Vy = 4, 
= AV, Vs = 16V, ¥_=32¥, and so on, then the variance of the 
Volio would increase without bound. 
‘Woh examples of rapidly increasing variances are of mostly academic 
fest If one ranked securities in general, or common stocks in 
lular, in order of increasing Variability, the variance of the 100th 
NW +1) Not be excessively greater than that of the 50th, the variance of the 
= oma would not be excessively greater than that of the 100th, nor would 
‘iiiance of even the 300th be excessively greater than that of the 200th. 
The variance of the average w of the N variables equals felurns from these securities were uncorrelated, there would be little 
Po ociated with widely diversified portfolios. But 
| The returns on securities are correlated, The consequences of these 
lations are discussed in the next section, 


var(w) = (5) vans 


Vv NN +1) 
mis 7) 
Ea cul Nuwoers oF CorretateD Returns 
But 
‘itiance of the average of a large number of future returns is closely 
10 the concept of an “average covariance.” We shall first deter- 
fence Jw many distinct covariances are associated with N variables and 


Wefine what we mean by the average covariance. We shall then be 
{u discuss the variance of portfolios consisting of large numbers of 
Mes 
number of distinct covariances, not counting variances and not 
 % as distinct from oy2, equals the sum of the first NV — I 
" 


i i Fe 
(N +1) becomes increasingly close to | as N increases. (1 
ample, starting with N = 1 we have (N + 1)/N = 2/1, 3/2, 4/3, 5/f 
101/100, + - -, 1001/1000,- +). As (W + 1)/N approaches 1, var( 
proaches V/2. When N = 100, for example, 


VH24340+-4(N—1), 


‘Sesple, if N= 2 there is one distinct covariance oy, If N= 3 
We 1 + 2 =3 distinct covariances, a1, 013, aNd Oy. If N= 
1 + 2 +3 = 6 distinct covariances oy, O19, O14, S29, Sess Fas: 
iH, /, has covariances with the N — | other securities, 


var(w) =. 1.01 = .505¥. 


In the present case the variance of the securities increases t00 
for diversification to make the variance of the portfolio approach 


This case, nevertheless, is an example of the power of diversi 19 F339 Tyas”) ay 
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ry has N — 2 additional covariances (not counting 4 listed above) 
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Fea» Typ * “sy Say 


Similarly ry has V — 3 covariances not previously counted; r, has N — 
additional covariances, and so on. Adding the covariances of r, to 
additional covariances of r,, to the additional covariances of r,, and so 


we obtain 
(N-)+(N—2)4+(V—-3)4-°-°4394241 


distinct covariances. The sum of the first N—1 numbers eq) 
(N — 1)Nj2. Therefore (N — 1)N/2 equals the number of disti 


covariances associated with N securities. 
The average covariance is defined to be 


the sum of all distinct covariances 
the number of distinct covariances 


That is, the average covariance equals 
the sum of all distinct covariances 
(N= DN 
Or, equivalently, the average covariance equals 


2- the sum of all distinct co 
(N— 1)N 


Conversely, the sum of all distinct covariances equals 


(N—)N 


+ the average covariance. 


In this section we shall assume that variances V,, Vs, Vs, °° * 
bounded: there is some number V* larger than any of them. 
In the last chapter we saw that the variance of the sum s equals 
var(s) = Vt Ha +My +0 - + Py 
+ 204, + 2oy3 + °° - + 24y 
+ 2oy +- > + 2ogy, ete. 


This may be summarized by the statement 


var(s) = (the sum of the variances) 


+ twice (the sum of all distinet covariances), 
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The average of the first N variables, 9° = s/N, has 
1\2 
var) = (4) var(s) 


_ the sum of the variances 
N? 
2+ the sum of all distinct covariances 
+ 5 7 
N’ 
Wut the sum of all distinct covariances equals (N — 1)N/2 times the 


‘Syeinge covariance, Therefore 


the sum of the variances | (N — 1)N 
,  -saitecaae erate 


a — + (the average covariance), 


adn) - (the average covariance) 


the sum of the variances 


‘The first of the two expressions on the right side of the above equation is 
the formula for the variance of w when returns are uncorrelated. We 
Hive seen that, if security variances have an upper bound, this sum of 
Jwilances divided by N* approaches zero as N increases, Furthermore, 
1¥ © W/N approaches | as N increases. Therefore, 


WN increases, the variance of w: approaches the value of the average 
raviance. 


Suppose. by way of illustration, that returns r,,ra.ra. + + all have the 
‘Wii Variance V and that all pairs have the same correlation coefficient 1/2. 
I) this case each covariance equals (1/2)V. The average of the covariances 
‘Wf {wo or more returns must also equal (1/2)7. When N = 100, 


vari) = OLY + 99[(1/2)¥] 
= (.505)¥, 


‘Shieh is only slightly greater than the average covariance. 
Compare this with the uncorrelated case: The variance of w is (1/2)V 
‘J i) the average of 100 uncorrelated returns, each with a variance of SOV. 
Tie variance of the correlated portfolio is the same as that of 100 inde- 
nies securities whose standard deviations are 7 times as great. If we 
faye together additional uncorrelated returns with variances equal to 
the variance of w falls toward zero. The variance of the correlated 
Winbles, on the other hand, can never fall below (1/2)¥, This is illus- 
Wied by Table 1, in which columns 2 and 3 present the variances and 
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standard deviations of the uncorrelated variables, while columns 4 and 
present those of the correlated variables. For the example of the tabl 


Jor example, if there were four securities in a portfolio, the contribution 
Wy the variance due to the first security would equal 


Ved 
1 
Tante | 7g lM + Aon + os + ow) 
DIVERSIFICATION AND CORRELATION ‘ i z 
Huppose we wished to select either security A or security B as the 100th 
‘Weurity of a portfolio. Each of the 99 securities already in the portfolio 
Number Standard Standard Jas the same variance V and each pair has a correlation coefficient of 1/2. 
Variance, si Variance, a 
of " Deviation, a Deviation, 
Bade Portfolio A : Portfolio B | potolio B 
urities Portfolio A ortfolio, Tame 2 
CORRELATIONS AMONG NINE SECURITIES 
1 5.000 2.236 100 316 — 
10 .500 107 055 235 Ne Me 
25 250 500 092 228 evrity mT. a'r |USS.| GM] gSe| CC. | Ban. | Frain. 
50 100 316 051 226 Mey] OU © 1%] oO] Mm | © 
100 080 204 0505 225 
250 020 Adi 0502 24 _— 1 
00 010 100 .0501 24 ) Ain, Tobacco | 
1,000 005 or 05005 24 ‘Am. Tel. & Tel. | 77 
10,000 0005 022 050005 2 US, Steel a | 33 
9 | 65 | 09 
AM, & Ste, wm] a9 | 43 | 47 
Portfolio A consists of (hypothetical) securities whose returns are indepen Coca-Cola 69 | dl 22 | 46 | 18 
and have variance equal to 5, Rorden 62 1m 2 40. 35 33 
Portfolio B consists of (hypothetical) securities whose returns have variai Firestone AS | 55 | 6t | 76 | 74} 38] 45 
Woaron Steel | 56 | 61 | si | 42 | 45 | 38 | 37 | 49 


equal to .1 and correlations equal to 1/2. 


_ 


THe CHOICE OF PORTFOLIO Average Correlation = 48. 


Table 2 presents historical correlations among the nine securities 
in Chapter II. The average of these correlations is .48, slightly less t 
1/2. This average may not be representative for all securities or | 
circumstances; but it does give us an idea, at least, of the magnitud 


‘Seurity A also has a variance of V and a correlation coefficient of 1/2 with 
securities in the portfolio, Security B, however, is much more specu- 

ive Its variance equals 25V; its standard deviation is 5 times as great 
}) that of security A. On the other hand, the return on security B is 


the correlations among security returns. ‘)iworrelated with any of the 99 securities already in the portfolio (perhaps 

Let us continue for a moment to consider portfolios with equal amou ‘eause the fate of this security depends, not on the swings of the general 
invested in each of N securities. The contribution which a parti ‘arhet, but on the success of some invention or the ruling of some court, 
security makes to the variance of such a portfolio—the amount by w ‘shone effect is substantially independent of the level of the market). The 


the variance of the portfolio would be reduced if the security in q 


eae ‘Wiiribution which the conservative security A would make to the variance 
were replaced by a completely riskless security—is equal to 


{he portfolio—the amount by which the variance of the portfolio would 

greater with A than with a riskless security—is 

V 4 (99 + (1/2)V) 
10,000 


2 
(4) [(the variance of the security) 


plus twice (the sum of all its distinct covariances) OV. 
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The contribution which the speculative security B would make is WWuline and deductive processes, leaving the analyst free to concentrate on 
processes involving induction and judgment. 

1 discharging his function of selecting securities to enter the analysis, 
ih analyst should keep in mind the properties of a security which may 
Wake it a worthwhile addition to a portfolio. Not only should promising 
yal) with respect to expected return and variance be considered, but also 
ihe value of low correlations should not be forgotten. 


The addition of the risky security produces a more conservative portfoli 
than the addition of the conservative security. 

This illustrates a basic principle: the security which is risky or con 
vative, appropriate or inappropriate, for one portfolio may be the oppos' 
for another. One must think of selecting a portfolio as a whole, ni 
securities per se. 

It would be incorrect to say that the variances or standard deviations 
securities haye little influence on the variance of a portfolio. True, t 
variance in itself adds little; it is the covariances which are importal 
The covariance, however, is related to both the correlation coefficient at 
the standard deviation. The covariance between r, and r,, for examy 
is equal to 


(the standard deviation of r,) times 
(the standard deviation of r,) times 
(their correlation coefficient). 


If security 1 has a standard deviation twice as large as that of security 
but security 2 has correlation coefficients twice as large as those of secut 
1, then each will make about the same contribution to the variance o} 
large portfolio. 

The returns on a security may be highly correlated with those of 
securities but not with those of others, The returns on another secul 
may have a different, perhaps a partially overlapping, pattern of hi 
low correlations. The problem is to pick securities so that their ave 
covariance is small. Since different amounts can be invested in diffe 
securities, the problem is really to choose securities and allocate funds 
that a weighted average of the covariances is small (where amoul 
invested determine the “weights”). In Part III of this monograph 
discuss techniques for minimizing variance fur various levels of ex 
return, These techniques permit an electronic computer to solve 
problem of finding the smallest weighted average of covariances consi 
with other objectives for, and restrictions upon, the choice of portfoli 
The computer supplements rather than supplants the human anal 
‘The analyst, or analyst team, must decide on the objectives of the anal 
select securities to be analyzed, decide on a model for derived covarial 
and make judgments pertaining to the expected returns, variance 
returns, and covariances of returns of securities. The machine takes 


17 
“sty the 


RETURN IN THE LONG RUN 


In general, if a portfolio provides returns equal to ry, rp, P3, °° 
Hisloric rate of growth of the portfolio equals 


VO FAME + +) (L+H) — 1 


This amount compounded annually for m years gives the same final value 


Cuapter VI 
Wf portfolio as did the returns ry, r° * +s ha: 


RETURN IN THE LONG RUN ‘ A bat 
Suppose that SI is wagered on the spin of the wheel in Figure 1. Let 7, 


Jepresent the gain or loss from this first spin. Suppose that the bettor now 


THe Geometric Mean 


Suppose that the return on a portfolio during eight consecutive yt 
was 


return | 15 |~.05] .20 | .00 |—.05)+.05| .00 | .10 


One dollar invested in the portfolio at the beginning of year 1 became $1. 
bythe end ofthe year. If this $1.15 were reinvested in the same porifolt 
it would have become $1.09 = (1.15): (.95) by the end of the second 
If reinvestment had continued, by the end of the third year the original 
would have increased to 


$1.31 = (1.15): (.95) + (1.20). 


Figure 1. A random wheel. 


gers his (+ ry) on a second spin of the wheel, winning or losing rz 
dollars per dollar bet. He now has S(1 + r,\(1 + 72), which he wagers. 
This continues for spins of the wheel, at which time his SI has become 


By the end of the eighth year the $1 would have increased to 
$1.44 = (1.15) - (.95) - (1.20) + (1.00) - (.95) - (1.05) - (1.00) - (1.10), 


$1.44 is not the same as the average return, 05, compounded annually 
cight years; (1.05)* equals 1.48—slightly greater than the value of 
portfolio after the eighth year. To find the rate of growth (the am 
which if compounded annually for cight years would equal the end 
of the portfolio) we must solve the equation 


rate of growth = V/(1.15)(95)1-20\1,00X.95)(1,05}(1.00)(1.10) — 1 


If the number of spins, n, is extremely large, it is virtually certain that 
j= 15 will have resulted from approximately 1/8 of the spins; r = .00 
Sill have resulted from 2/8 of the spins; and generally the relative fre- 
uency of each return will be close to its true probability. We say this is 
“yitually certain” rather than “certain’ because it is possible, for example, 
Jor y= .15 never to occur, even if m = 1,000,000. The probability that 
‘Wy (elative frequency will differ from its true probability by any specified 
smount (e.g., the probability that the relative frequency of r = .15 will 
ler from 1/8 by more than ,0001) approaches zero as n increases. Asn 
Herenses, it becomes increasingly certain that relative frequencies will be 
Wty close to true probabilities. 

If the relative frequencies of returns exactly equaled the probabilities of 


SU + AVL FAM toads (lt rh 


The expression ~V/(1.15)(.95\(1.20) - - - (1.10) is known as the g 
‘mean of the numbers (1.15), (95), ete. A practical method of eval 
this eighth root is noted in a subsequent section. 
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these returns, then the rate of return resulting from reinvesting for m spint 
of the wheel would equal 


Ope thousand observations! were used: ¢.g., about 83 years of monthly 
prin and losses, Let 2 represent the proportion of the individual's 

folio which is devoted to the security; (1 — 2) is devoted to cash. 
lable shows the performances over time of portfolios with 


VW(1-15)(95)(1-20)(1.00Y.95}(1.05(1.00K1.10) — 1. 


When 2 is extremely large, when frequencies are virtually certain to (OX = 30, 
close to probabilities, the experienced rate of return is almost certain to (2) X= .16, 
close to the above value, (3) X = 1.00, 

Tape | (4) X = 2.00. 


RESULTS OF SAMPLING EXPERIMENT Y = 2.00 represents the purchasing of the security with 0% “margin.” 
‘The investor borrows SI for every $1 of his own and invests both of these 


Wi the security. Of the four policies listed above, this one has the highest 


“Mont Strategi Strategies 
Mone if fisecst e ‘sapected return. On the other hand, it has the lowest geometric mean of 
(oy o's)| (D9 | @ | (3) | (4) (oy o's| | @ 3) t/ 41). This geometric mean, in fact, is less than 1. The investor who 
r= Sz = 76x = 1.0/2 = 20) z= Sx = 162 = 1.0) yyists in this policy is almost certain to end with less than his initial 
Biren. In our sampling experiment the “virtually certain” did in 

10 93 | 87 81 52 285) 3.72] 4.00 feet happen: The policy had highly variable returns in the short run and 


0) decidedly unprofitable in the long run. The SI initial investment 
soiled as 1/2 cent, 

Policy (3), with ¥ = 1,00, would have the greatest expected return if 
Wuying on margin were excluded. It has the second highest geometric 
jean of the four policies. In the sampling experiment it ended second 
twat after the 1000 random drawings. We see below that luck was with 
this policy—it ran a closer second than may be usually expected. 

Policy (2), with ¥ = .76, has the greatest geometric mean, At the end 
if the 1000 drawings it emerged best of the four policies. If in the 1000 
‘Wawings a return of —.2 appeared exactly 100 times, a return of —.1 
speared exactly 200 times, and so on, then X = .76 would have proved 
feiter than any other value of X. It rarely happens, however, that the 
Japected proportions are exactly realized. Usually a different value of ¥ 
i eX post, the most profitable. In the present sample X = 84 was the 
os! profitable in retrospect. As sample size increases, the observed 
frequencies “tend” to stay closer to the expected. Eventually the policy 
‘lich maximizes the geometric mean not only almost certainly does better 
than X = 2, but almost certainly does better than even XY = .84 (or 
Vm .77, of X = 75), 

Policy (1), with X = .5, has a lower geometric mean than policy (2). 
Ih is, however, less variable than the latter. One might prefer (1) to (2), 
wwerificing return in the long run for stability of return in the short run. 


200 
2 | 112] 1.16 | 118 | 1.04 | 250 | 286] 3.45] 3.29 
30 | 1.51 | 203 | 242 | 3.77 | 300 | 3.79] 5.03] 5.05 
40 | 1.95] 260 | 326 | 552] 350 | 9.53] 19.19] 27.36 
$0 | 1.83 | 232 | 276 | 3.42 | 400 | 11.98 25.66 | 37.49 
60 | 1.72} 210 | 239 | 235 | 480 | 672} 10.02] 10.02 
1 | 145] 158 | 162 | 85 | soo | 7.54] 1114] 10.58 
80 | 183) 223 | 249 | 1.80 | 550 | 890] 1338] 12.37 
9 | 226} 310 | 3.77 | 3.40 | 600 | 14.26| 25.59] 26.90 
100 | 264] 380 | aa | 543] 650 | 7.17/ gaa] S70 
MO | 201} 245 | 2.67 | 1.26 | 700 | 10.44] 1407} 10.41 
120 | 217] 271 | 298 | 1.27 | 750 [11.76] 15.78] 11.19 
130 | 236 | 305 | 342 | 1.47 | goo | 14.43) 19.95] 13.83 
M40 | 270] 3.69 | 4.35 | 212 | 950 | 2528] 43.64] 35.45 
1530 | 240 | 3.04 | 331 | 1.03 | 900 | 44.66) 97.42] 94.63 
950 | 43.38 | 87.18 | 75.41 
1000 | $4.60 | 116.98 | 103.54 


Table 1 shows the results of an illustrative sampling experiment. 
suppose that the investor can place his funds in either cash or a particul 
security. The probability distribution of returns on the security are 


return | —.2| -.1] 0 | +.1] +.2 


probability} 1 | 2 | 3 | 3 | a1 


* One thousand random digits were interpreted as follows: a random digit = 0 
Wlieated 2 —.2 return; a random digit = | or 2 indicated a —.1 return, etc, 
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THE MEAN VALUE OF THE LOGARITHM. 


Logarithms can be used to facilitate the calculation of the geor 
mean. The concept of the logarithm can also help us understand 
approximate relationship among the mean, the variance, and 
geomeiric mean. 

‘Two types of logarithms are in general use. The common logarithm 
constructed so that the number 10 has a logarithm equal to 1. 
natural logarithm is constructed so that the number e, approxima' 
2.7183, has a logarithm equal to 1. The natural logarithm of a number 
always equal to approximately 2.3026 times the common logarithm of 
number. We shall let log(A) represent the common logarithm of 4; 
shall let Ln(A) represent the natural logarithm of A. 

Both the common logarithm and the natural logarithm have the folle 
ing extremely important properties: 


1. The logarithm of A times B equals the logarithm of A plus 
logarithm of B. More generally, the logarithm of A,:A,° 43° °° 
equals the logarithm of A, plus the logarithm of A, plus the logarithm 
Ay, etc, 

2. The logarithm of A* equals 1 times the logarithm of A. 


3. The logarithm of V/A equals the logarithm of 4 divided by n. 


To caleulate ¥/(1.15)(.95)(1.20}(I.00).95)(1.05)(1.00)(1.10) by using 
mon logarithms we consult one of the published tables of logarithms to 
that 


log (1.15) = 061, 10g (1.00) = .000, 
log ( .95) = —.022, log (1.05) = .021, 
log (1.20) = .079, log (1.10) = .041. 


From this it follows that the logarithm of (1.15\.95)(1.20)(1.00)(.95)(1. 
(1.00)(1.10) equals 
.061 — .022 + .079 + ,000 — .022 + .021 + .000 + .041 = .158. 

The logarithm of ¥/(1.15)(.95)(1.20) «+ « (1-10) equals .158/8 = 0; 
Once again consulting a table of logarithms, we find that the anti-lo 
.02—the number whose logarithm is .02—equals 1,047, Thus we 
found, by addition and division rather than by the taking of an eighth 
that the rate of growth on the portfolio was 4.7% per annum. 

In general, the common logarithm of the geometric mean of the num 
(htm (i trh- > (+ 7) equals 

Jog(] + 74) + log(l + ry) +> * > + log(l + r,) 
n 


This is either the average or the expected value of log(l +7) de- 
fering on the nature of ». The natural logarithm of the geometric 
fa) of (1 +7), similarly, equals the average or expected value of 


ball +7). 


Hp of 1 +7) minus 1. 


Wurnatic ArPRoxiMaTions 


For returns (r), from about a 30% loss ( = —.3) to about a 30% gain 
+3), the natural logarithm of 1 + ris approximately equal to 


Hyiside the range from—.3 to + .3 the approximation becomes increasingly 
Table 2 compares the approximation r — (1/2)r? with the actual 


poor 


COMPARISON OF La(l +r) AND r— (1/2)r? 


v= (1f2yr%, 


Taste 2 
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()/, to put it another way, the rate of growths the anti-log of the average 


Lri(l +r) r= (1/2)? 


—.20 


+.00 
+210 
+.20 
+30 
+40 
+.50 


Seles of Ln(1+r), The approximation r — (1/28, like any weighted 


sum of rand , is referred to as a quadratic. 


1 Lod +r) is approximately r — (1/2)r*, then the expected value of 
Jot) +r) is approximately 


expt(r) — (1/2) expt(r?). 
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It is shown in the footnote’ that 
expt(7?) = var(r) + (expt(r)]?. 
Thus the expected value of Ln(1 +r) may be approximated by 
expt(r) — (1/2) var(r) — (1/2) lexpt(r). 


“r —(1/2)r is an approximation “centered” around r= 0. 
similar approximation centered around r = E, where £ = expt(r), is 


eB l@— Ef 


Ln(l + £) + THE 308% 


The expected value of the above expression is 


Ln(l +6) = 3 


Both the first approximation, 
E — 5 briny) + £4), 


and the second, 

1 var(r) 
Ln(l + £)—5 a+ 
allow us to estimate expt(Ln(1 + r)], and hence the rate of growth of 

portfolio, from its expected return and variance. 
The accuracy of these estimates in the case of past returns is illust 
by Table 3, which presents the expected return, standard devial 
Ln(1 + r), and the values of the two approximations for the nine secut 
of Chapter I, It presents similar information for portfolios P and 
Portfolio P maximizes E — (1/2)[var(r) + E%). Portfolio Qis the 
portfolio with greatest Ln(I + £) — [(1/2) var(ry(I + EVI. 
Columns 5 and 7 of the table show the ratio of approximate to 
for each of our approximations. The approximation E — (1/2)(a* + 
shows a marked downward bias. In every case the approximation 
less than the actual. In one case (security 8) the approximation exhil 


1 varlr) = expt{r — expt(r)}® 
= expt(r*) — 2 expt{r- expt(r)) + exptfexpt(r)*) 
= expt(r’) — 2 expt(r) - expt(r) + fexpt(r)? 
= expt(rt) — fexpt(rrP. 
Hence 
‘expt(r?) = var(r) + Texpt(r)F. 
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TWO APPROXIMATIONS TO E£ log (1 +7) 
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an error of 20% In two other cases it had an error in excess of 10! 
The approximation Ln(1 + £) — {(1/2) var(r)(1 + £)'] fared better, It 
largest error in the cases examined was 5.1% (security 9). It was partici 
larly accurate in cases with small standard deviations. Its error was I 
than 1.5% for the four securities with the lowest standard deviati 
(securities 1, 2, 6, and 7 with standard deviations of .231, .121, .203, ai 
-169 respectively). 

In Figure 2, Ln(| +r) is plotted against log(] + £) — {(1/2) var( 
(1 + £)*] for the securities and portfolios considered. There seems 10 


16 


log(+ £) — [50/014 £)?] 
8 5 5 & 


z 


02 


04 06 «08 
#Llog(1+r) 


10 12 1616 


Figure 2. Quadratic approximation to E log(1 + 7). 


an upward bias in the region of high Ln(I +r), but as a whole 
approximation seems good. 

Using the approximation Ln(1 + E) — {(1/2) var(r)/(1 + E)'] we 
estimate the rate of growth to be expected in the long run from const 
reinvesting in portfolios with expected return E and variance of ret 
var(r). For example, if £ = .1 and var(r) = .04, then the expected val 
of the natural logarithm of (1 + r) is approximately 


Ln(1.1) — {(1/2)(.04)/(1.1)%] = .095 — .017 = .078. 


The anti-log of .078 is 1.08. The rate of growth over the long run 
be expected to be about 8% per annum. 


q 
{im return for short-run stability, 
‘Thapter II the portfolio with greatest average logarithm had an average 


is 
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The combination of expected return and variance which promises the 
lest return in the long run is not necessarily the combination which 
meets the investor's needs. The investor may prefer to sacrifice 
In the nine-security example of 


RETURN IN THE LONG RUN 


Win of approximately .188 and standard deviation of return of approxi- 
ely 292. If the reader will locate this point on the curve in Figure 4, 
pler II, he will probably find that, if these means and standard devia- 
Wis reflected probability beliefs, he would prefer a more stable portfolio 
i that which provides greatest long-run potential. 
Although it is not the sole consideration in the selection of a portfolio, 
of growth is not without interest. The rate of growth associated with 
tfolio—the rate of growth to be expected in the long run from always 
esting in portfolios with the mean and variance the portfolio in 
estion has currently—can be estimatad by the formula of this section. 


PART Il 


EFFICIENT PORTFOLIOS, 


Cuarter VII 


GEOMETRIC ANALYSIS OF EFFICIENT SETS 


|NUCTION 


A portfolio is inefficient if it is possible to obtain higher expected (or 
ge) return with no greater variability of return, or obtain greater 
nly of return with no less average or expected return. The problem 
‘separating efficient from inefficient portfolios, when standard deviation 
‘Wiliance is used as a measure of uncertainty, is treated in the present 
ipier, in the following chapter, and in Appendix A. The present 
lef presents a geometric analysis of portfolios containing three or four 
ities. The following chapter presents computing procedures for 
ing efficient portfolios from the means, variances, and covariances 
‘iy number of securities. Appendix A shows that the portfolios 
jueed by these procedures are, in fact, efficient. 
Foy readers who prefer to omit purely computational discussions, the 
chapter presents a general description of the concepts and rela- 
Nips used in deriving efficient portfolios. For readers who wish to 
we further the problems and techniques of computation, the present 
ipier presents concrete examples to illustrate the general discussions of 
pier VINE and Appendix A. 
Will not be possible to prove all relationships asserted in this chapter. 
Have supplied such proofs would have required a more extensive 
ition of analytic geometry and calculus than seems appropriate here. 
sader who is familiar with these branches of mathematics should be 
jo confirm most assertions himself, Since it is still true for the 
Wis of this chapter, as it was for the preceding ones, that an under- 
ing, of its proof is essential to the understanding of a theorem, we can 
Hope to communicate information about, rather than an understand- 
‘of, the topics covered, 
# shall consider first analyses involving three securities; then analyses 
ving four securities. The four-security analysis introduces all the 
129 
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‘This forbids negative investments (short sales), Thus in the standard 
folio analysis a portfolio is considered legitimate if, and only if, its 
ind X, are each equal to or greater than zero and the sum Y, + 
X, equals |. Any portfolio represented by a point below the X, 
‘His in Figure I is not legitimate, since it violates the condition X, > 0. 
portfolio represented by a point to the left of the X, axis is not 
imate, since it violates the condition X,>0. Any portfolio repre- 
by @ point above the line labeled X, = 1 — X, — X, = 0 is not 


main concepts needed for the analysis of portfolios involving any num! 
of securities. 


Tue Set of LeGitimaTe PortFouios 


In the three-security analysis of the present and following sections 
shall let 


X, represent the fraction of the portfolio invested in a first security; 

X, represent the fraction of the portfolio invested in a second secur 
and 

X, represent the fraction of the portfolio invested in a third security, 


If, for example, 


then 75% of the value of the portfolio is in security 1; 25% in security 3, 


X, = 1.00, 
X= 0, x x 
X= .00, 
the entire portfolio is invested in security 1. . Geometric representation Figure 2. A non-standard set of Iegiti- 
In a three-security analysis uilolios, mate portfolios. 


Xt Xt X=. 


‘The fraction invested in a third security follows from the fractions in) 
in the other securities: 


limate, since it violates the condition X,>0. A portfolio is legiti- 

Hi, therefore, if'and only if it lies on or in the triangle abc. The area on 
in this triangle is referred to as the set of legitimate portfolios. 

‘The requirement that X, > Qis a restriction on the choice of portfolio. 

festrictions are usually referred to as constraints. The standard 

Holio analysis has four constraints: 


Xy=1- Yh 


A portfolio may be represented geometrically as in Figure 1. 
horizontal axis of the graph indicates the value of X;. The vertical 
indicates the value of X; The value of X; is given implicitly 
relationship X20, ¥, 20, ¥,>0, and 


X= 1-¥,-¥,. M+ %+ My = 1. 
The point labeled P, for example, represents the portfolio with X 
X, = .50, and X, = 1 — 25 — 50 = .25. 


The standard portfolio analysis requires that 


Sonstraints determine the set of legitimate portfolios, The set of 
imate portfolios is a geometric representation of the constraints, The 
lyri N-security analysis has N + 1 constraints. In the standard 


x20, ‘Weurity analysis, for example, the six constraints are 
eee X20, % 20, %20, % 50, X,>0, and 
X20. 


Mth t Ut ut X=. 


ar isa 
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The constraints of a portfolio analysis need not be those of the stan 
analysis. Suppose that, as in the example security analysis based 
probability beliefs in Chapter II, there is a minimum requirement on 
current “income” of the portfolio. Suppose that the current incomes 
the three securities are .04, .02, and .03, respectively, and the i 
requirement for the portfolio is.03. A portfolio, then, is legitimate onl 


04%, + 02%, + .03X, > 03. 


Henceforth: 


, will represent the covariance between the returns of the ith and jth 
\tieS; 9, for example, represents the covariance between the first 
second securities, as in previous chapters. 
#, will represent the mean (average or expected) return on the ith 
Wily, (wis the small Greek letter mu, pronounced myoo.) 

Y, will continue to represent the fraction of the portfolio invested in the 
seeurity. 

J will represent the expected (or average) return on the portfolio as a 


Substituting X, = 1 — X, — X,, we may express the requirement as 
O1X, — 01%, >0. 


To be legitimate the portfolio must be represented by a point below the 
labeled .01%, — 01%, = 0 in Figure 2, Legitimate portfolios, in 
present case, are represented by points in and on the triangle abd. 
The computing procedures presented in the next chapter apply wi 
constraints have one or more of the following forms: MEAN Lines 


A\Xy + Ag + Ns +> > + AX = B 


¥ will represent the variance of return on the portfolio as a whole, 


Ii a three-security analysis the expected return E on the portfolios as a 


or 
AX, + AXy + Ag t+ + AyXy SB E= Xypy + Nolte + Xatta- 
or Be 2 
AX AN OAL an ae Aye felationship expresses the. results of Chapter III in the notation of the 
Ay Ag ES) : tion, It says that the expected (or average) return on the portfolio 
where Ay, 4g, Ay,* * +, Ay and B represent numbers. An example ‘} Weighted average of the expected (or average) returns of its securities. 


Weights involved are the amounts invested in each security. 
Il we substitute | — X, — X, for X; in the above equation, we obtain 


E= Xyy + Xo + (1 1 — Xa 
= Xa = Ha) + Xa(te = Ha) + Hee 


first type of constraint is the standard 


Mt yt tot Ay= 


An example of the second type of constraint is the standard 


x20. us 
E = X(t, — fa) + X a(t — a) + Hs 


i*s |he expected value of the portfolio to the amounts invested in X, 
XY) Suppose that 


Anexample of the third type of constraint is the non-standard requi 
x, < 05. 


The computing procedures of the next chapter apply to analyses in 
any number of such constraints.” 


yy = 10, pg =.05, and p, = .07. 
W\ the expected return on a portfolio would be 
E = .03X, — 02%, + 07. 
}} porifotios with expected return equal to 08 must satisfy the equation 
O8 = 03, — .02¥, + .07, 


| The computing procedure is based on the assumption that X, > 0. Ifwe 
allow short sales in security 1, but keep the convention that X, > 0, we can 
amount invested in security 1 equal the difference between two variables: 


y=4-%, 


where Z; is the amount purchased and ¥, is the amount sold short. Both Zy 


must be non-negative, Ol = 03, — 02%). 
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© /4y = Hay the iso-mean lines form a system of parallel lines, with 2 
tion of increasing E, as illustrated in Figure 3. 

If 4 = #2 = uy all portfolios have the same expected return. In this 
zero) does equal (.01). Conversely, X, = 2/3 and ¥, = 1/2 satisfy the only efficient portfolio is that (or those) with smallest variance. 
equation; therefore the point with X, = 2/3 and X, = 1/2 is on the li itherwise stated we shall assume that the u's are not the same. 

A portfolio has an expected return equal to .08 if, and only if, itis 1h Figure 3 we have included the triangle on and in which are contained 
sented by a point on the line labeled E = .08. legitimate portfolios for the standard three security analysis. The iso- 
i lines are not confined to this triangle. An expected return can be 
(bed to a portfolio even though it is not legitimate for a particular 
lynis. 
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This equation is represented by the line labeled £ = .08 in Figure 3. 
point with X, = 1/3 and X, = Ois on the line; therefore X, = 1/3 
Xq = 0 satisfy the above equation; that is (03 times 1/3) minus (02 ti 


VARIANCE CURVES 


‘The variance of a three-security porifolio may be expressed as 
Vow Xion + Xfon2 + Xtos + 2X, Xe + IX Xan + 2XXon 
ituting 1 — X, — X, for X5, we obtain 
Vow Xho + Xho + aas(l — X, — XQ? + 2XXatie 
+ 20,3X,(1 — X, — %,) + 2oyX(1 — % - XY), 
\, upon expanding squares and rearranging terms, becomes 
V = Xilouy — 2043 + 5g) + XGl0 — 2005 + Oa) 
+ 2K, Xylo, — oy — Fy + On) 
+ 2Xi[o1s — 3) + 2Xsl0ea — Fal + Ons- 


expresses V in terms of X; and X,, For three securities with 


Oy = Oy = 01, 


Figure 3. Iso-mean lines. 


oy = .04, 
Oy, = .005, and 


All portfolios with expected return equal to .06 must, similarly, 
On = On = 0, 


X, and X, which satisfy the equation 
=.01 = 03%, — 02%. variance of the portfolio equals 
Vom XYL01 + .04] + X9L01 + .04) + 24, %2[.005 + .04) 
+ 2Xy[—.04] + 24[— 04] + .04 
= OSX} + .05X} + 09%, X, — 08%, — .08X, + 04. 
perifolios with variance equal to .01 must satisfy the equation ] 
Ol = OSE + 05XE + .09%,¥, — 08%, — .08X, + .04; | 


This equation is represented by the line labeled £ = .06. The line is 
locus of all points representing portfolios with £ = .06. Figure 3 
presents lines which are the loci of all points representing portfolios 
expected returns equal to .05, .07, .09, and .10. 

Such loci of points with the same expected return will be referred 
iso-mean lines, The arrow in Figure 3 indicates the direction of in 
expected return, As we move in this direction we find iso-mean 


associated with increasing expected returns. As long as we do not Ws 


OSX} + OSX} + .09X,X, — 08%, — 08%, + 03 = 0. 
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The curve marked V = .01 in Figure 4 is the locus of all points satis 
the above equation. This iso-variance curve is an ellipse. (An ellipse 
be roughly described as a flattened circle, the circle itself being a 3 

case of an ellipse. The paths of the planets about the sun are exampl 


ellipses.) Figure 4 also presents iso-variance ellipses for V = .02, V = 


and V = .04, All these ellipses have the same center, the same oriental 
and the same ratio of longest diameter to shortest diameter. The 


variance curve for V = .01 is contained within the iso-variance curve 


%| 


Figure 4, Iso-variance curves. 


V =.02, This in turn is contained in the iso-variance curve for ¥ 
As V increases, its iso-variance curve expands without changing its 
shape, its center, and its orientation, 

The point labeled ¢, with X; = X, = 8/19, is the center of the 5} 
iso-variance ellipses. The variance, .006, of the portfolio represet 
the point cis smaller than that for any other legitimate or non-lej 
portfolio. There is no iso-variance curve for V less than .006, 
portfolio has a variance this small. ‘There is, on the other hand, 
variance curve, centered at c, for any V greater than .006. 

Typically the iso-variance curves for any set of three securities 
system of ellipses as illustrated above. Each ellipse of the system 
same shape, orientation, and center. The center ¢ represents the 
mate or non-legitimate portfolio with smallest variance. ¢ may {i 
or outside the set of legitimate portfolios. There is an iso-varianet 
associated with every value of variance greater than that of ¢. AS 


innee ellipses are presented in Figures Sa through Sd. 
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cs, the iso-variance ellipse expands. Examples of systems of iso- 


Figure 5. Systems of iso-variance ellipses. 


\\o-variance curves are not systems of ellipses if, and only if, one or 
Af the following conditions hold: 


1% = 205 + 045 = 0; that is, the random variable (r, — r5) has 
variance, 

Hy = 20,5 + 45 = 0; that is, the random variable (r, — r5) has 
varia or 

the random variables (r; —r3) and (r_— rs) have a correlation 
Jent equal to either +1 or —1, 


Wi condition occurs either when o,, and ayy are both zero, or when 


E q 


- 
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lk Ws we move from V, to V, to Vz, It rises again as we move from 
( V,to V,. Of all points on the E; iso-mean line the point b, at which 
line touches the Vy iso-variance ellipse, has the smallest variance, 
y \so-variance line contained within the V, curve does not touch the E, 
Any iso-variance curve which surrounds the V, curve has a higher 
lane than Vy. 

AL b the ¥, curve is tangent to the E, line: it touches but does not 
the £, line. All other iso-variance curves either do not touch the 
AL all or else cross it twice. Only the ¥, curve is tangent to the £, 


ry and rs are perfectly correlated. The second condition occurs u 
analogous circumstances. In the present chapter we shall assume 
none of the three conditions occurs; hence the iso-variance curves 
always ellipses. (The correctness of the calculations described in Chaj 
VIIL does not depend on these assumptions. The assumptions 
however, simplify the geometry of the analysis.) 


THE CRITICAL LINE FOR THREE SECURITIES 
1) any other iso-mean line the point with the smallest variance is 
jlarly the point at which the line is tangent to an iso-variance curve. 
i) the point d has smaller variance than any other on the line £ = Ey; 
point c has smaller variance than any other on the line £ = E,; the 
Wi ¢ has smaller variance than any other on the line E = £,. The 
i / similarly represents the portfolio which gives smaller variance than 
other, legitimate or non-legitimate, portfolio with expected return 
to Ey, 

‘The line labeled // is the locus of all tangencies between iso-mean lines 
Wsovariance ellipses. It is the locus of all points which minimize 
Jihee among portfolios with the same expected return. We refer to it 
the critical line. Ifa point is on the critical line, it minimizes variance 
some value of expected return. Conversely, if a point minimizes 
ice for some value of expected return, it is on the critical line. 

‘The critical line is @ straight line rather than a curve. It always passes 
lugh ¢, the center of the iso-variance lines. The critical line may or 
iy NOL pass through the set of legitimate portfolios. 


Figure 6 contains both iso-mean lines and iso-variance ellipses. 
line labeled E = E, is the locus of points representing portfolios 
expected returns equal to F, The lines labeled E = Ey, E = Ey 
similarly, loci of points representing portfolios with other ex} 


WNT PORTFOLIOS 


Pure 7 includes the following information concerning a standard 
lysis of three securities: 


Figure 6. The critical line, 


1)) the point c representing the portfolio with smaller variance than any 
{ portfolio, legitimate or non-legitimate; in the present example c is 
imate 

+) the point ¥ representing the portfolio with greater expected return 
i Huy other legitimate portfolio; 

1)) the direction of increasing E; and 

1) the critical line, //. 


Teturns. The ellipse labeled V = Vj is the locus of points repr 
portfolios with variance equal to V,. Those labeled V = V,, V 
and V = V, are other iso-variance ellipses. 

Imagine starting at the point a and moving along the line I 
E= E, in the direction indicated by the arrow connected to the 
As we thus move along this iso-mean line we successively meet t 
variance curves V= Vy V=Vy, V=V, Vo Vy Va My 
‘expected return is the same at every point along the line. The vai 


thwll argue that the above information implies that the heavy line, 
'F lepresents the set of efficient portfolios. That is, we shall argue that 
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1) no portfolio can have smaller variance, so a fortiori no portfolio can 
Smaller variance with the same or greater expected return. 
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a portfolio is efficient if, and only if, it is represented by a point on 
heavy line. 
To be efficient a portfolio P must meet the following three conditi 


efore c is efficient. 
No portfolio with an expected return smaller than that of c—no port- 
fepresented by a point above the iso-mean line E = E—can be 
font, All such portfolios have a lower expected return and a higher 
ice than that of c. In particular, if we move along the critical 
from c in the direction of decreasing E, we do not reach efficient 
ollos, 

1 the other hand, if we move along the critical line from ¢ in the 
tion of increasing £, we do encounter efficient porifolios. Consider 
portfolio represented by a point P on the critical line between c and a. 

8 portfolio is efficient since 


{1) P is a legitimate portfolio; 

(2) if any legitimate portfolio has a greater expected return, it must 
have a greater variance of return than the portfolio P; and 

(3) if any legitimate portfolio has a smaller variance of return, it 
also have a smaller expected return than the portfolio P. 


An inefficient portfolio meets the first condition but fails to meet eithe! 
second or the third condition. A portfolio which fails to meet the 


1) iL is legitimate; 

) if'a portfolio has greater expected return it must lie below the iso- 
i) line which passes through P; but the iso-variance curve passing 
i P, and all iso-variance curves inside this one, lie above the iso- 
i) line passing through P; hence any portfolio with a higher expected 
| must have a higher variance of return; and 

1)) if portfolio has a smaller variance than is associated with P, then 
just be represented by a point inside the iso-variance ellipse which 
through P; but all such points are above the iso-mean line passing 
igh P; hence any portfolio with a smaller variance must also have a 
lier expected return than P. 


‘shy point P on the critical line between c and a is efficient. 
Tie points on the critical line below @ cannot be efficient since they are 
legitimate. The critical line and the set of efficient portfolios must 
Ways at this point. 
141 Ws consider any iso-mean line (E = E*) which passes between a and 
‘The non-legitimate point b represents a portfolio with smaller variance 
that of any other point on this line. As we move from b, variance 
es, The point Q, on the boundary of the set of legitimate port- 
. is closer to 6 than any other legitimate point on E = E*. Thus the 
olio represented by Q has a smaller variance than any other legitimate 
oll with expected return equal to E*. The portfolio represented by 
‘illafles the three criteria of efficiency: 


Figure 7, The set of efficient portfolios. 


condition is neither efficient nor inefficient, but simply non-legitima 
is possible for a portfolio to meet conditions | and 2 but fail to 
condition 3. It is also possible for a portfolio to meet conditions | 
but fail to meet condition 2. All three conditions, therefore, mi 
confirmed to ensure efficiency. 

The portfolio represented by the point ¢ meets the three conditi 
efficiency: 


(1) it is legitimate; 

(2) it has a smaller variance than any other portfolio, so no le 
portfolio can have a greater expected return with the same or 
variance; and 


1) iis legitimate; and 
J) all points with cither greater expected return and no greater variance 
We variance and no less expected return—the cross-hatched area A in 
¢ tare not legitimate. 
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jent, Points within the cross-hatched area B represent portfolios with 
) less variance and greater expected return than those of a typical 
mate point Z not on the heavy bent line. (Why is the portfolio 
ented by the point Y inefficient ?) 
gure 9 illustrates a case in which c is not legitimate. The legitimate 
X with smallest variance is that at which the iso-variance curve 
) » V) is tangent to the X, axis. This point is efficient, as are all points 
wen it and the intersection of the X, axis and the critical line. The 
Nt of the critical line which lies within the legitimate set is efficient; 
Ws the segment of the X, axis from 6 to ¥. 
igure 10 illustrates a case in which 4, = Mg > #, and c is within the 
Of Iegitimate portfolios, The set of efficient portfolios is simply 
‘egnent of the critical line from ¢ toa. 6 represents a portfolio 
h fulfills conditions 1 and 2 but fails to meet condition 3 for 
ney. 
pire 11 illustrates a case in which the legitimate point X with minimum 
H/siwe is the same as the point ¥ with maximum expected return. In 
fave the set of efficient portfolios consists of exactly one point, X = ¥. 


% 


Figure, 8. Efficient and inefficient port- Figure 9. Efficient portfolio 
folios. 


Thus any portfolio represented by a point Q between a and ¥ is effi 
The portfolio represented by ¥, with expected return greater than tl 
any other legitimate portfolio, is itself efficient. ‘ 

The portfolios represented by the points on the heavy line in Figi 
are therefore efficient. Figure & illustrates why other portfolios af 


4, 


Figure 12. Efficient portfolios. 


Figure 12 illustrates a case in which the critical line does not intersect 
#1 of legitimate portfolios. One interesting feature of this case is that 


Figure 10. Efficient portfolios. Figure 11. Wi of efficient portfolios seems to reverse directions at the point a, 
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points in and on the tetrahedron whose corners, a, b, ¢, and d, represent 


The reader will note, however, that it always moves in the directio 
portfolios with, respectively, 100% invested in security 1, 100% 


increasing return as it moves from X to ¥. Another interesting featu 
this case will be noted later in connection with the relationship bet 


the expected return and variance of return of efficient portfolios. % 


t 


Figure 14, Standard legitimate set for four securities, 


ed in security 2, 100% invested in security 3, and 100% invested in 


Figure 13, Efficient portfolios. 


{Ivis section we define and label various subsets of portfolios. These 
I), oF subspaces as we shall call them, are defined without regard to 
Wequalities 


Figure 13 illustrates a case with a non-standard set of legitimate 
folios, polygon abcdef. The set of efficient portfolios is the heal 


Xgh¥. 
KH 20 ASO yyS0 eS 0: 


THE STANDARD FouR-SECURITY ANALYSIS 
Hill, however, consider only portfolios which satisfy the equation 


Certain features of the many-securities analysis, not apparent 
three-security analysis, appear when we add a fourth security. 
few sections discuss efficient portfolios for the standard four-securi 
This discussion illustrates concepts crucial to the analyses, in Chal 
and Appendix A, of standard and non-standard cases involvil 
number of securities. 

Just as the three-security case required only two-dimensional gi 
the four-security case requires only three-dimensional geometry. 
portfolio can be identified by Xj, Xq, and Xy, since X,= 1 — 
Xs. In Figure 14, X; is plotted along one axis; %; along anotl 
X, along the third. The standard set of legitimate portfolios is re 


Ht hye yt Kal. 


jdcedure will first be to present the definitions of, and notation for, 
‘particular subspaces, and then discuss the general definition and 
{iN Applicable to all these subspaces. The purpose of these distinc- 
‘#i notation will become clear in the two following sections. 

Ny subspace s,,2.4 consists of all points (portfolios) which satisfy the 
1g (WO conditions: 


(I) X= 0, 
Q M+ ++ X%,=1. 
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The absence of subscripts 1 and 3 indicates that 5,4 contains only 
ils with X, = X; = 0. Geometrically s,,, is the X, axis of Figure 14. 
fontains only points with X, = X, = 0. It is represented, geometric- 
iy, by the X, axis. 

4) Contains only points with X,= X,=0. Geometrically it is the 
Passing through the points a and 6 of Figure 14, Similarly 5,3 is the 
fussing through @ and ¢; sp.3 is the line passing through the points 
ant 


The subspace s, consists of all points satisfying the conditions: 
%=0, %y=0 %=0, 
M+ ++ X= 1 


The portfolio with 
X,=1/2, %= 12, X,=0, and X= 0 


is in the subspace 5,,2,, since it meets the two conditions: Xs does 
zero, and X, + X_+ Xy + X, does equal 1, The portfolio with 


% =3/4, Xp=3/4, Xy=0, and X= —I/2 


is also in s,,4.: nothing in the definition of s,,,., rules out negative ¥ 
of X's. The portfolio with 


X= 14, Xp= 1/4, Ky= lA, and X= 1/4 


is not in 5,,2¢ since it violates the condition that X, = 0. Geomet 
Sia is the plane in Figure 14 which contains both the X, axis and 
X, axis. Any point on this plane has X = 0; any point with Xy = 
‘on this plane. The presence of subscripts 1, 2, and 4 indicates 
X, Xp, and X, vary within 5,.».q subject only to the constraint 


X + M+ Xs) + X= | 


The absence of a subscript 3 indicates that X, = 0 for all points i 
subspace 5; 9.4: 
The subspace 5,5,4 consists of all points with 

% =0, 

Yt ht UHtUal 
The absence of the subscript 2 reflects the fact that all points in 5), 
X,— 0. The presence of the subscripts 1, 3, and 4 indicates 
Ys, and, vary within 5,9. Geometrically the subspace 5.34 
plane containing both the X, axis and the X, axis. 

The subspace s;,3,4 consists of all points with X, = 0, and Xy + 
Xy+X,= 1. Geometrically 254 is the plane which conta 
Ny and X, axes. 

The subspace 5,25 contains all points with X%,=0 and X, + 
Xy+ X%y= 1. Geometrically 5,23 is the plane which passes throi 
points a, 6, c in Figure 14. 

The subspace 5,,, is the set of all points satisfying the constraint 

X%,=0, X= 0, 
Xt t+ yt X=. 


the point a in Figure 14 is the only one which satisfies these four 
Hints. 5, is the set containing only the one point, a. 5, 53, and 5, 
milarly, subspaces containing one point each. 

Subspace 5, 2.5.4 consists of all points with 


Kt y+ %yt X= 


fmence of all subscripts indicates that none of the X"s is required to 
| wero. Geometrically 5,934 consists of all points which can be 
ytd in Figure 14 (ignoring the limitations of the page). These 
already incorporate the constraint X, + X, + X,+ X,=1. 

ipeneral, we distinguish 15 subspaces which we divide into four groups: 


I Maan 

Pita Sine Soe haa 

Ha Na Stas Seas Saas $3.45 
HH, Ss 53, Sy. 


Subspace consists of points satisfying two types of constraints: first, 
A+ Xs + X= 1; second, if the number i does not appear as 2 
ipl of y, then X, = 0. 

444!) feferred to as a three-dimensional subspace, since it consists of 
i in the three-dimensional Figure 14. The subspaces in group B 
#ilimensional, since cach is represented by a plane. The subspaces 
if © are one-dimensional, since they are represented by lines. The 
§ in group D are referred to as zero-dimensional, since they 
Of only one point each. 


The absence of the subscripts 3 and 4 indicates that X, and X, 
for any point in s,. Geometrically 5, , is the X, axis in Figure I 
points on this line have ¥, = X, = 0. The points on this line dil 
in the amount invested in X, and ¥, = 1 — %. 


A Lines 


HW three-security analysis the critical line played a crucial role in 
Hop efficient from inefficient portfolios. In a four-security analysis 
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several critical lines are needed to serve the same purpose, For 
subspace distinguished in the last section we define a critical set. In 
cases the critical set is a straight line; in some cases the critical set 
single point. We keep track of critical sets by means of a notation si 
to that used for subspaces. For example, 


h the critical set associated with the subspace 5, 2,5, 
|..z.; is the critical set associated with the subspace s, 
|g is the critical set associated with the subspace 5;,; 
1, is the critical set associated with the subspace 5. 


ial to. 1, there is one with smallest variance. The point representing this 
folio is on 25.4. Of all portfolios with expected return equal to .2, 

is one with smallest variance. The point representing this portfolio 

OW Iai 25 are the points which give smallest variance for any other 

| of expected return. 

is thus, by definition, the locus of all points P which satisfy the 

ing conditions: 


11) Pisin sya.n, (ie, it has % +X, + ¥, +X, =1); 
12) of all portfolios in s, ».¢ with the same expected return as P, P has 


Critical sets associated with other subspaces are similarly denoted. {mnallest variance 


We shall first define the critical set /, 93,4 for the three-dimensi 
«- Next we shall define the critical set for a represent 
al subspace; then for a representative one-dimen: 
subspace; and finally for a zero-dimensional subspace. The relatiot 
between critical sets and efficient portfolios is discussed in the next 


definition applies whether the u's are all equal or not. When all p's 
‘ual, only one value of P satisfigs the above criteria. When the y's 
Hot all equal, the set of points satisfying the above criteria form a 
ight line, In the typical case, therefore, /, 23.4 is a straight line rep- 
Hing those portfolios which minimize variance for various values of E. 
Implicit in the discussions of the present and next section Is an assill ‘The definition of 4,5 is similar to that of /y2s,, except that /y...5 is 
tion concerning variances and covariances. This assumption isa ied exclusively in terms of the points of the subspace 5,95. A point 
zation of those made in the three-security case to ensure that the 110 (2.9 if it meets the following conditions: 
variance curves were ellipses. In the four-security analysis the assum 
ensures that the iso-variance sets are ellipsoids—flattened sphere 
three-dimensional generalization of the ellipse. [Technically we a! 
that the covariance matrix of (r; — ra), (te — rely and (ry — 1%) 18 
singular. This assumption is used every time we presuppose that 
is one and only one portfolio with smallest variance among certain 
portfolios which arise in our discussion of the geometry of 
sets. The assumption implicit in the present discussion is not ¢: 
to the computing procedures presented in the next chapter. 
The nature of /y.».3,4 depends on whether 


1)) Pis in the subspace 5, and 


1) of all points in 5,» with the same expected return as P, P has the 
lest variance. 


» in other words, is the locus of points in 5,2. which minimizes 
lance for various levels of expected return, If 44, = py = fy, hho.s is 
point—that which represents the portfolio with'smallest variance. 
fy and ps are not all equal, /, 2. is a straight line. exactly the 
As the critical line of a three-security analysis involving X,, X_, X3. 
‘©f add a fourth security to a three-security analysis we add new critical 
OMI ae eon aerate bot the critical line 43 stays the same. 

(2) at least two ys have different values. critical sets, Ny e.as Ay as 4.9.4 associated with the other two-dimen- 

| subspaces are defined similarly and have similar properties. 

‘The definition of the critical set 4. for the one-dimensional subspace 
4) formally the same as the definitions of critical sets for higher- 
fisional subspaces. A point Pisin /, , if, and only if, it satisfies two 
lions: 


In the first case all securities, and therefore all portfolios, have the 
expected return. 4.9.3.4, in this case, is a single point, that representi 
legitimate or non-legitimate portfolio with smallest variance. I 
second case, when two p’s are different, it is possible to find portfoli 
any level of expected return, They may not be legitimate; they 
represented by points beyond the printed page. Still it is possible t 
values of X,, Xy Xa Xy = 1 — X, — Yo — Xy which satisfy the eq 


B= 1X1 + Hak, + MyXa + oaXe 
for any value of expected return E, Of all portfolios with expected 


1) Pisin s,9: and 

1) of all P in s,.» with the same expected return as P, P has the smallest, 
lane. 

is itself a straight line, /, » is either a single point or else is the same 
fy = ay, then gis a point. If, * up, then /, » is the same as 
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Not every legitimate portfolio represented by a point on a critical 
is efficient. 
‘The relationship between critical lines and efficient portfolios can be 
Vibed by a simple device. Imagine the critical lines as railroad tracks, 
yjine, further, that a passenger boards a train at ¥ and travels in the 
ition of increasing expected return. The first time his train crosses 
iver track—as soon as the first critical line intersects a second critical 
the passenger transfers to the new track, the new critical line, and 
i (tavels in the direction of increasing E. Again he reaches another 
A, an intersection of critical lines, and again he transfers to the new, 
inuing in the direction of increasing £. This continues until the 
Niger reaches ¥ with maximum £. There his journey ends. 
Tyery point through which the traycler passes on his journey represents 
fMeient portfolio, Conversely, every efficient portfolio was reached 
foute, In Figure 15, for example, the set of efficient portfolios is 
sented by the heavy bent line that starts at ¥. moves along /.o.o4 
|| (his intersects /, » 3, moves along the latter until it reaches /, 5, moves 
i {yy until it finally reaches ¥. 
How do we know that the traveler always remains within the set of 
limate portfolios? How do we know he passes only through points 
senting efficient portfolios? How do we know that he will ever 
ve ai 22" Unless we can answer these questions satisfactorily we can 
fo confidence in the computing techniques of the next section. The 
jane that the traveler's route has the properties ascribed to it is 
ted in Appendix A. 


5,2. Similar remarks apply to the critical set of any other one-dimensi 
subspace. 

The critical set /, is exactly the same as the subspace s,. The crit 
sets fy, fy and /, are similarly the same as the subspaces 5, 53, al 
respectively. 

Figure 15 shows a system of critical sets for a four-security anal 
The relationship between the critical lines and the set of efficient portfe 
also shown in Figure 15, is discussed in the next section. Critical 


xy] Ethcrent pettolos —= 


Figure 15. Critical lines and efficient portfolios. 


one- and zero-dimensional subspaces could have been defined f 
three-dimensional analysis of previous sections. In that analysis 
proliferation of critical lines would have added ittle. In traci 
efficient sets for many securities, however, critical sets for one- and 
dimensional subspaces play an important role. The geometric de 
of the present chapter, in particular, is simplified by the notion of e 
lines for one-dimensional subspaces. The computing procedures 
next chapter require the notion for both one- and zero-di 
subspaces. 


HPOLIO ANALYSES INVOLVING ANY NUMBER OF SECURITIES 


basic features of analyses involving large numbers of securities are 
ily found in the four-security analysis. These basic features are 


1)) the definitions of, and notations for, subspaces; 

J) the definitions of, and notations for, critical lines: 

)) the tracing of the set of efficient portfolios from X to & along critical 
q 


EFFIctenT PorTFOLIOS as 
‘ie are, say, seven securities altogether in an analysis, then the sub- 


4syu5.7 Consists of all portfolios with 
Mt Met wt Mt yt Xt %, 
X,=0, and k= 0 


We shall let X represent the legitimate portfolio with variance 
than that of any other portfolio, X is to be distinguished from 
portfolio with variance smaller than any other portfolio, legitiy 
non-legitimate. If c is legitimate, then ¥ =e. If c is not legi 
eo 

Every efficient portfolio is on some critical line. The converse 


) Stunlly the usual procedure is to trace the efficient set from #10 ¥. In this case 
Seeiiion is how can we be sure that the computation will finally arrive at ¥. 
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fwint with expected return E, and variance Vg. As he continues along 
(witical line, he passes through a point with expected return £, and 
Jance V,. The point at which he transfers to the next critical line has 
and M4 for expected return and variance. If he had continued along 
‘itical line beyond the region in which it was efficient, he would have 
hed a point with expected return E, and variance of return Vs. Along. 
ew critical line a new parabola describes the relationship between 
aid V 
The set of points representing efficient portfolios turns a corner, forms a 
i) kink, as our passenger transfers from one critical line to another. 
es typically no such kink, however, in the curve describing the 
ion between E and V forefficient portfolios. The point a in Figure 17 
rates the typical relationship between their parabolas at the inter- 
jon of two successive critical lines of the efficient set. The parabolas 
Jangent: they touch but do nbt cross. The relationship between V 
moves in a straight line, then turns and moves in another straight Ff \ransfers from one parabola to the other without discontinuity 
then turns again and moves in another straight line, continuing un ink 
reaches a point with maximum £. The path thus traced is a cont 1 our traveler was on the critical line corresponding to parabola 2 for 
series of straight line segments. 4 than Ey, he finds that this critical line intersects that of parabola | 
As long as our passenger moves along a single critical line, the rel f » Ey and it becomes non-legitimate for E greater than Eo. He is 
ship between expected return and variance of return is represented i, therefore, to transfer critical lines. If, on the other hand, our 
ger was on the critical line corresponding to parabola | for E less 
he finds that the critical line corresponding to parabola 2 goes 
being non-legitimate to being legitimate at E= £,. To stay with 
Jnl portfolios he must transfer to this new and better critical line at Eq. 
¥ 1) is, however, possible for the curve relating efficient V to efficient E to 
kink. This occurs when the set of efficient portfolios turns corner a 


The critical set |, 5.4.5.7 consists of all points P such that 


(1) Pisin sya453 and 
(2) of all points in s,.5,4,s,7 With the same expected return as P, P has 
smallest variance. 


With the various critical sets (mostly lines) defined similarly, the 
efficient portfolios is traced as before, by starting with Y and movi 
the direction of increasing £, transferring from one critical line to ant 
whenever these intersect, until 2 is reached. { 

The reader may find it instructive to apply this general procedure to 
three-security examples of Figures 7 and 9 through 12. 


Erricient ComBINaTIONs: £ AND V, E AND STANDARD DEVIATION 


In tracing the set of efficient portfolios, our imaginary passenger 


Figuie 12. Whenever 2 kink occurs, it must be of this nature _/) 
¥ than of this nature . 


curve relating efficient standard deviation to efficient expected 
4) has properties similar to the curve relating E and V. The chief 
nce is that the former is principally made of segments of hyperbolas 
y than parabolas, If the analysis includes a security with zero 
wiee, One segment of the curve relating efficient standard deviation to 
Wel expected return is a straight line, as illustrated by the segment 
Wing at (E, c) = (0,0) in Figure 4 of Chapter II. The curve relating 
/\t standard deviation and efficient expected return has a kink, if, 
only if, the E, V curve has one, 


E 


BREE 


Figure 16, V and Ealong acritical line. 


Figure 17, Relationship 
consecutive efficient 


parabola, such as that in Figure 16. The solid segment of the pi 
corresponds to the segment of the critical line which is efficient. 
broken portion of the parabola corresponds to the remainder of the 
line. When our passenger first reaches the particular critical line, 
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SUMMATION SIGN 


1s commonly employed to denote summation. Thus 


2 
3 x= 
A 
Carrer VIII iy the same thing as 


Mt Met Mtoe t Xe 


Js the Greek capital letter sigma, not to be confused with the small 
M0. The expression 
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INTRODUCTION 


Wy be read as “the sum of X;, for i from I to N,” or as “sigma X; for i 


This chapter presents the “critical line method” for deriving el 
J to N.” Here are some other examples involving E: 


portfolios. This method, based on principles illustrated in the 
chapter, processes the means, variances, and covariances of any nul 
of securities. From this information it obtains the implied 
portfolios. 

Several early sections of this chapter are devoted to matrix alj 
Although it is possible to present the critical line method wil 
reference to matrices, a discussion of this subject is included for 
reasons: 


. 
Y Xm= £ 
rs 

# compact way of saying 


Nyy + Natta + Xgby to + Xyay = E 


the three-security case we may write 


1. The required principles of vectors and matrices are not di 
learn, even for the reader with a meager mathematical background, 

2. A much greater insight into the rationale of the computing pri 
is possible for the reader familiar with matrices. 

3. If the reader is to pursue problems of computation, if he wis! 
Jook into the closely related field of linear programming, for exam) 
knowledge of matrix algebra is invaluable. 


3 
2 Xm E 
¢ than 
Kye + Kota + Xyea = E. 


Weller under © indicates what is to be summed. Thus 


x 
2X, 
Notions such as the addition, subtraction, and “scalar multiplicati gnu 

matrices are easily grasped, since they closely parallel ordinary 
tions of arithmetic. “Row by column multiplication” of mati 
less natural but a little practice gets the reader over this hurd! 
notion of an inverse and its modification, which may present a 
hurdle, is essential for an understanding of much of the com| 


OX + 92Me + oaks to + + Oy Xy 


forticular when i = Land N= 3 


Xs = OX + Xe + 9X3. 


procedure 

The concepts and notation of the sections up to and including t 
“Addition, Scalar Multiplication’” are used in later chapters. The 
who decides to omit or postpone discussions of computing procedi 
advised to read these preliminary sections before going on to the 
chapter. 


sis the contribution made to the variance of a portfolio by the first of 
securities.) When the nature of the summation is clear from context, 
Ws permissible to write, for example, 


x 
TX pu, oreven Y Xu instead of ¥ Xu. 
a 
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yoy XN 
$F $.x,Xjoy represents a sum of sums (Sx 
ieiget fei 
Kon XXi0u XXn 
wa + + + 
DY DHX os = | KMart | XeXe02e | + | H%2700|- 
ne + e + 
XX, XXs0n, XX 3035, 
Or, equivalently, 


ue XX + Wo + Mtn 
LEM = + MX + MeN + Veto 
aie + XX + YX + XoXses 


The general relationship between the variance of a portfolio and 
covariances (including the variances) of securities may be expressed as 


wk 
Vad ¥ XX 0. 


ist jat 


When the nature of the summation is clear from context, we may wri 


yy 
TINK instead of YY XXoy 


ieijat 


Matrices, VECTORS 


‘A matrix is a rectangular array of numbers enclosed by brackets, 


example: 


691 
[ } 
483 


000 8 -! 0 
00 Ol, [1,6,-4,1) Si 0, ie 
00 0. a ietds 0 


Ifa matrix has m rows and n columns, we say that it is an m by m mal 


Thus the matrices listed above are, respectively, 2 by 2, 3 by 1, 2) 
Ly 1, 3 by 3, 1 by 4, and 3 by 3. The numbers within a matrix are 


its elements. 
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Im =n the matrix is said to be square. The first, fourth, fifth, and 
Wyenth of the above matrices are square. If = | the matrix is called a 
tolunn vector. The second and fourth matrices are column vectors. If 
| the matrix is called a row vector. The fourth and sixth matrices 
46 1ow vectors. 

In ordinary arithmetic the operations of addition, subtraction, multi- 
Pilcation, and division “combine” two numbers to “produce” a third 
juinber In matrix algebra there are standard operations which combine 
two matrices to produce a third matrix. These matrix operations include 
Wilition, subtraction, and “scalar” multiplication (discussed in the 
Weereding section); and matrix multiplication (discussed in the next 
Heeeeding section), 


ApbiTioN, SCALAR MULTIPLICATION 


Suppose that A and B represent two matrices, each with m rows and n 
wolumns. Then 
CsAt+B 


Sani by nmatrix. Each component of Cis the sum of the corresponding 
ements of A and B: if a,;, b,;, and c,, are the elements of the ith row and 
Mh column of 4, B, and C, respectively, then 


Cy = Ay + iy. 


[rnb +L} 


The difference between two m by n matrices D = A — Bis anm by a 
Watrix each of whose components is the difference of the corresponding 
Himponents of A and B. Thus, if d,, is the element in the ith row, jth 


ile 


We shall discuss two kinds of multiplication involving matrices. The 
We called scalar multiplication, is discussed in this section. The second, 
soiled matrix multiplication, is discussed in the next section. 

Mf A is an m by nm matrix with elements a,,, and c is any number, then 


Pxamece: 


Pearce: 


B=ca 
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is an m by n matrix whose elements are 
by, = cay 


cA is referred to as the “scalar product” of ¢ times A. 


ExAampLes: 
iE l= lt 
4 =2. a | 
—t =3 1 3 
ie nl--b zh 
EXERCISES 
1, Compute 


“TL EE «EU 
“eG = of 


2. A matrix all of whose components equal zero is called a zero matt 
Thus the 2 by 3 zero matrix is 
000 
000 


A=At+0 
where O represents a zero matrix with the same number of rows and columny 
the matrix 4, 
3. Show that, as with numbers, A + B= B+ A. 
4, We have defined matrix addition only for pairs of matrices, Show tl 
as with numbers, 


Prove that, as with numbers, 


{4+ B]+ C=A+ (B+ C). 
Thus we may write 
AtBHC. 
5. Show that, as with numbers, if 


C=A-B 
then 
A=CH+B 
6. If 0 is the number zero and 0 is an m by m zero matrix, show that 
04 =0 
for any m by n matrix A. 
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7, Show that, as with numbers, 
A-—B=A+(-1)B. 


4. Show that, as with numbers, 
(A + B)= cA + cB, 


Shite 4 and B are m by m matrices and c is a number. 


ANSWERS 
i} 5 0 2 
(a) [ (b) | ©) | -6}; 
3 0. 2 
-! 18 2 
(d) | i} (e) [*} @) | 4]. 
=} 6. 2 


Matrix Muctiptication 


The definition of a matrix product, A+ B or AB, is not so straight- 
lorward as the definitions of sums and differences. A special case of a 
‘aW"ix product is the product of a row vector and a column vector. The 
Jule for forming such a product is illustrated by the following example: 

2 
4 

U,3,5,7) f= 12434 45°64 7-8 = 100, 
8 


more generally, 
by 


by 
{ay. ay, a, a4) pare by + gba + agby + aby. 


by 
Fis, in general, if 
by 


a=([q,---,a,) and b=|- |, 
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} 2 by 2matrix. In general, if the ith row of A is (aj, Oy.° * * @,), and 


then their product is defined to be 
We jth column of B is 


a-b = Sab, = aby + ayby + °° + bye 


b, 
Such a “matrix product” of a row vector by a column vector is referred the 
as an inner product. The reason for, and the usefulness of, this definiti ., 
will emerge subsequently. 
The “inner product” is defined only for the multiplication of a row ai a 
acolumn vector with the same number of elements. The matrix prodi te 


A» B, is in general defined only when A has the same number of colut 
as Bhas rows, If A isan m by r matrix, and Bisanr byn matrix, t 
ABis an m by n matrix, For example, 


ihyen the clement in ith row, jth column of AB is 


Arby + Gaba +o > °F aide 
Thus, if C = AB, c,, is the inner product of the ith row of A and jth column 
Wf. The restriction that the number of columns of A equal the number 
‘of rows of B is required for this inner product to be defined. Note that, 
wilike ordinary arithmetic, AB does not necessarily equal BA. For 


bh 
Pw-b4 


Two matrices are “equal” if and only if each element of one is equal to 


the corresponding element of the other. Thus, given our definition of 
Wulliplication, 


a3 by 4 matrix times a 4 by 6 matrix produces a 3 by 6 matrix; 
a | by S matrix times a 5 by 1 matrix produces a 1 by | matrix; 
a5 by | matrix times a 1 by 5 matrix produces a 5 by 5 matrix; 
a 4 by 4 matrix times a 4 by 4 matrix produces a 4 by 4 matrix. 


Multiplication is not defined for 


a3 by 2 matrix times a 4 by 6 matrix, 
a 5 by | matrix times a 5 by | matrix; 
a 3 by 3 matrix times a 4 by 4 matrix. 


Thus the number of columns in A must equal the number of rows i 
for AB to be defined. AB has the same number of rows as 4, and 
same number of columns as B. 

The definition of A - B is illustrated by the following examples: 


13) [2 4) [t-243-6, 1443-8 
fF sal Waleed pee 
20 28), 

-[ i 
24) fl 3) p2-dt4es, 2344-7 
i iG leleataa Aa 


22 27 
46 74) 


3.4:117%) [1 

28 6||%|=|9 

19 4jlyJ Le 
Weans that 

3444+ %=1, 
i) 2%, + 8%, + 6X = 9, 

Ny + 9X, + 4Xy = 2. 
Hf we tet 


cit 1 
For any 2 by 3 matrix A and 3 by 2 matrix B, A * B equals A=|2 8 61, b=/91, 
ibis Li 9 4. 


then the equations (1) may be written 


Byrd 
iezase Wt + Aiabar + Aabsy Audie + aiePes + 


bub | = 
tuna) | | Lenbint textes + Obs Onde + bes + 
AX=b. 
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¥ RG 
If Y= Gi x= » 
Yom. xX, 


and A is an m by n matrix, then 
Y=xAX 


expresses the Y; as weighted sums (linear combinations) of the X;. 


EXERCISES 


1, Confirm that 


pay 416 10 
() [: -1 [i ‘ J-[: 20 « 
2 afl 4 8 32 20, 
1 Beith 32 
(b) 3/B,12—=] 9 3 6); 
6. 18 6 12 


1 
© B, raf = [18). 
6. 
2. Compute 
1 2 100 
286 
o fish} o Riders 
i 2 2 4. oo 1 
3, Show that z 
40 = 0, 


where 0 represents a zero matrix. 
4, Suppose that Y= AX as above; suppose that 
2 
zZe=- 
zZ, 
is defined by a p by m matrix B, as follows: 
z= BY. 
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Filher show in general or confirm for some special case that 
2 = (BA)X, 
whwie (8A) is the p by n product of A and B. 
§ What three equations in three unknowns are represented by 


347 
(XY Xe nfs 8 ‘| = (1,9, 2). 
| He at 
Note: The answer is not equations (1) of the text. 


4 Either show in general, or confirm in some example, that (AB)C = A(BC) 
What is, the product of A and B multiplied by C equals A times the product of 
Haid C). Because of this relationship we may write ABC without ambiguity. 


ANSWERS 


38 20 6 
2” lia Al Wy i 4 i 


3%,+2%,+ X=; 
AX, + 8X, + 9X, = 9: 
X, + 6X_ + 4Xy=2. 


‘Thansrose, Symmetry, MAIN DIAGONAL 


A’ represents the transpose of A. A’ is the matsix which has the same 
tows as A has columns, has the same columns as A has rows. For 
Hrample, 


,2,4°=|21], 
6 
ez 

18 9) 
=|8 4], 

ea 
9 6] 
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‘A unit matrix is frequently represented by the symbol 7. The reader 
should confirm that, whenever the sizes of the matrices are such that 
Wultiplication is defined, 


A matrix is symmetric if A = A’; for example, 


Hy 3 3 
if A= then A’ = ; 
2 2 3) 2 


hence A is symmetric. On the other hand, 


i [ ‘] 
if A= then A’ = 3 
oe 34 


hence in this case A is not symmetric since 


Noes tz 
# E 

24 364 
Only a square matrix can be symmetric. 
The main diagonal of a matrix consists of the elements 44), @aa, say 


The elements of the main diagonals of the following two matrices 
circled: 


Al =A, 
IA=A, 
Thus the product of any matrix A, multiplied (on the left or right) by a 


Wit matrix, is A itself. The matrix / plays a role in matrix algebra 
similar to that played by the number | in ordinary arithmetic, 


As the reader may confirm, 
10071 0 o} flood 
13 off-13 13 0}=]o 1 0]; 
0 2 1JL73 -23 1] lo o 


! 0 ofl 0 0 100 


iQ. 1378), 14 
2671 —1/3 1/3 O}]1 3 0} =JO 1 OF. 
9. 3 ay ale 243° 2/3 10 2 1 ool 
1680 Whenever two such square matrices have the property that 
ro Aiealeus AB = BA =I, 
3 @8 6 we say that Bis the inverse of A (and, conversely, A is the “inverse” of B). 
i Wa square matrix A has an inverse at all, it has only one inverse 
L9 6 @ 4 {Proof Suppose that both B and C were inverses of A. Since CA = J 
wid AB = J, we have (CA)B = B and C(AB) hence C = B.) 
pees The symbol A- is frequently used to represent the “inverse of A.” 


Hf we know A-!, we can easily solve a set of equations 
1. Show that for any matrix 4 the main diagonal of A is the same as the {!) AY=b. 
diagonal of 4’, 

Multiplying (1) on the left by 4-1, we get 


‘THE INVERSE AOAY = Ab, 
‘A square matrix with “ones” along its main diagonal and “zi Y=A7, 
elsewhere is called a unit matrix. Thus the 3 by 3 unit matrix is ‘ 
Yor example, the solution to 
100 
10 OX, 1 
oO 1 0}. 
1 3 O||x%|=|-2 
oo1 


o 2 14Lx%,. 0 
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is 
iN; tte Oe MO paul 1 
%}=|-13 3 off -2|=]-1}. 
% 23-43 JL of L-2 
If A is an m by m matrix which has an inverse, then the solution to 
bn 1 
; 0 
A =|0 
bal LO 


is the first columns of A~!, Let £, be an m element column vector 
"in the ith place and zeros everywhere else; for example, if m = 


1 0 0 
Bei) Chet) &=|0l 
0. 0. 1 
Then the solution to 
bu 
A =£, 
b, 


is the ith column of the inverse. Not all square matrices have in) 
For example, the matrix 

eee 

eT, 


has no inverse since it is impossible to solve either 
2.210% Th k Ala (i 
= or =| |. 
1 Ly, 0. 1t 1 
14 
00 


has no inverse since it is impossible to solve 


[ le]-C) 
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The following properties of inverses are noted without proof. 


| A square matrix A has an inverse if and only if there is no vector 


% 
X=}: |, 
Xm, 
Miher than the vector 
07 
M=leh 
0 
Hoh that 
0 
AX= 
0) 


2A square matrix A has an inverse if and only if there is no vector 
Ye (%.' ++ Yq other than ¥, = (0, - -, 0), such that 


YA =0. 
), If A and B are m by m matrices such that 


AB =I, 
then we also have 
BA =I. 


4, A matrix has an inverse if and only if its “determinant” is not zero. 


A square matrix which does not have an inverse is called singular. A 
huirix which does have an inverse is called non-singular. 


EXeRcIsES 
|, Suppose that A = [a] isa 1 by 1 matrix. What is its inverse? When is a 
{by 1 matrix singular? Does the rule 
if AX=6 then X= A" 
ayply 10 1 by 1 matrices? 
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2. Construct a singular and a non-singular 2 by 2 matrix, 


3. Use 
10 oy? x, =1, 
130] tosove X,+3X,=1, 
O21 


21X,+ Xy=1. 


The vector 


MobIFviNG AN INVERSE 


Ih satisfies Ay = 
Suppose A is an m by m matrix with an inverse A-!; suppose that B ie attics Ati aay, be exptesicdae 


m by m with an inverse B-!, Then ABis an m by m matrix whose y= rp 
is B-~ A“ (note the change in order). This is easily proved: 
(note the chee? i ora) ibe a cay With y thus defined we may express 4 as 
(AB)B“1A-!) = A(BB-)A-! = AA = 1. : 
OM 
Suppose that A is an m by m matrix, 1 oh 
Cs Laan lb 
Os F ; 
amd: on =AE, 
Gypy” © *) Tum, Ye 
with an inverse Bud 
Aphanen 
At= . . 
0 Yon i 


where E, as indicated above, is a matrix whose elements ¢,, satisly 
Suppose that 4 is a matrix which is the same as A except that the 


if j =k, 
column of A has been replaced by the vector Se 
A ey= 41 ifies#k, 
h 
C 0 otherwise. 
pq 
The inverse of Eis the m by m matrix 
on m. 1 —nlYe 9 
Ct cs Ce — vl 
d= = u 
a ‘ve F 
Dare? Am rats Bs Gaara” * "Ooms ; 


Suppose that 4 also has an inverse (d)-1. There is a simple fe 


which relates the coefficients of A-! and 4-1, This relation will o Yn Ye 1 


169 


Welul in our £, V computing procedures, It may be derived as follows: 
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whose elements é,, satisfy 
—yile  iff=kbutitk, 


Mye if i= kandj =k, 
a= 
ala ifi=j#k, 

0 otherwise. 


This may be shown by confirming that 
EE" = FE =I. 


(Left for the reader.) Applying the rule for the inverse of a product, 


find that 
(Ay! = (AEY? = EA. 


Hence, as the reader should confirm, the coefficients a” of (4)-” satis 


fa —(yilyya? if xk, 
Hi 
7 ety fori =k, 


EXERCISES 


1, Show that 
(ABC)? = CAB A+, 


2. What is this relationship between 2” and a’ if the kth row, rather than) 


‘kth column, is modified? 


THE PORTFOLIO SELECTION PROBLEM 


A portfolio is represented by a vector 
x 


X= 
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Where X, is the fraction invested in the jth security. We shall assume 
X,20 for all j. 
The X, are also subject to constraints: 
OX, + ayXy + aysXy ts + 4,K, = by, 
Oy X, + OX, + dyyXy ++ * + OanXK, = dy, 
An X, + Opp Xy + Img Xy + °° * 
Tp the standard case m = 1, 
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Oy = Oy = Oy =' + =O, = 1. 
That is, the only constraint other than X, > 0 is 


Y= 1. 
‘Ai Inequality such as 
04K, + 02%, + 03%, > .03 


San be expressed as an equality by introducing a “dummy” security. Thus 
‘Uh above inequality can be represented by an equality if we let 
04K, + 02%, + .03X, — X, = .03, 
Where X20; 44 = 0; 04 = Ofory = 1,2,3,4, Aninequality such as 
Ms5 
19) be represented by an equality 
X+%y=5, 


where %, is another dummy security with X, 20; 4s = 0; og = 0 for 
Wi) If there are, for example, three securities (other than dummies) 
Prehased subject to constraints: 


i) X, 20, J=1,2,3, 
H 

i) YX=1, 
re 

0 04%, + 02%, + .03%5 > 03, 

“ Brcaty 2123, 


‘tan express the system as inequalities X; > 0 plus a set of equalities 
‘by letting 


M+ Mt % =1, 
04X, + .02%, + .03X, — ¥, = .03, 
% +X 
X +X 
a +X = 5 


X,20 for j=1,2,3,4,5,6,7, 
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where X,, Xs, Xq, and X, are dummy securities with mean and vari 


equal to zero. 6 my he 
A system of m equations can be written as aM bees 
AX =, 
where A is an m by n matrix, X is as previously defined, and 6 is an : 
element column vector. Wecan also represent X,>0 for j = 1,- - -, NI ne ay Oa ay 
X20. 


The expected return on a portfolio is 


E=SXity 
where m, is the expected return on the jth security. (4, = 0 for dum 
securities.) We shall let « be the column vector of 4. Its transpose 


HE (ys Hay * 4 Ha) 
The expected return on the portfolio is the inner product 


That is, 
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‘The following new matrices are used in the computation: 
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E=yp'X, Where 4’ is the transpose of A.M thus is an (m + 7) by (m + n) matrix, 
Tie variance of return onthe portfolio) ‘Wade out of C, A, A’ and an m by m zero matrix. M is symmetric. 
V = Id XX, iy 
may also be written * : 
x 
peas a8 lkye o| fo 
Ve(X Xp Ke: alee p ig 4 -[ 
Gay Ture 2 Dns 


Xe ‘ 
(Confirm this.) | 
If we let X’ be the row vector (Xy,- ~~, Xq] and C be the mat 
covariances, then we can write 


= ¥Cx. iis last m elements are those of the vector b. 
Since oy = 044 Cis symmetric, a 
7 
NOTATION 
The inputs to the efficient portfolio analysis, we have seen, are Mele ite 
C, the m by n matrix of covariances; Se (; } 


Hs the (column) vector of expected returns; 
A, an m by m matrix; 
5, an m element vector. 


A and 6 form the constraint equations AX = b. Lo 


‘Ais an (m + n) clement column vector. Its first n elements are zeros; 
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al] =R+ Sky. 


AV \s the matrix M except that the rows and columns corresponding to 


S is an (m+ n) clement column vector. Its first n clements are those | 
the vector j1; its last m elements are zeros, 

We define a unit cross to be an intersecting column and row vector Wi 
zeros everywhere except for a | at their intersection; for example, 


on ; Aa Wilibles which are out are replaced by unit crosses. For example, if 
0 001000 © 4, if-variables | and 3 are in and 2and 4 are out, then #7 is M with the 
0 0 0 ‘Jil and 4th rows and columns replaced by unit crosses. Our computing 
0 one vO iocedure is such that Mf is always non-singular (has an inverse). The 


yallubles X; in (1) are, as defined before, the amounts allocated to various 
Weurities. We have not yet met the variables 4,,** °, Aq. There is one 
Jy tach constraint equation, Thus, when 


rh = 


the only constraint equation, m = 1 and 


Suppose that 


Then to “replace” the second row and column by # “unit cross” means a 

change M into iv ‘ 
io) inet A a x, 
Oy 0 Oy a a, 
a, 0 4 0 


The role which the 4 play in our computation will emerge subsequently. 
He \s the (m + n) component column vector defined previously. S is the 
‘Slumn vector S, except with zeros corresponding to variables which are 
Wl For example, ifn = 4, m = 1, andj = 2 and 4 are out, then 


Critica Lines 


The set of efficient portfolios is made up of segments from critical li 
Associated with any critical line is a set of variables which are referred 


as in. Those which are not in are referred to as out. The formula fe in 
critical line is the following: n) 
Be Ma 
- 0 
0 
xX, 
w ia} = Rp Sigs |, is a number (rather than a vector or matrix). If'a particular value of 
A |) \s specified, equations (1) can be solved for X. 
‘ The formula for the critical line may be written 
ru a) ()] = (MR + (MY Shy. 
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M~ has @ unit cross in every row and column which is out. 
computationally more convenient to write M-! with a zero cross inst 
a unit cross for rows and columns that are out. (A zero crossis like a 
cross except that it has zeros everywhere, including at the inte 
We therefore define N(i) to be Mf-! (for the ith critical line we arrive 
tracing out the efficient set) except with zero crosses in out positi 
With this notation (2) may be written 


x 
(3) ("] = MAR + Ni) Say. 
Note that S may now be used instead of 5. 


THE COMPUTING PROCEDURE 


It will be convenient to illustrate the general instructions by a parti 


example. We shall use the following portfolio selection problem for 


purpose: 
n= 3. The covariance matrix is 
0146 0187 .0145) 
C=|.0187 .0854 .0104}. 
0145 0104 .0289. 


The expected returns are 


p= |.146]. 


The only constraint equation is 
Xt t+ %yak 

Of course, X, >0 for j = 1,2,3. 

From C and p follow 
0146 0187 0145 1 
0187 .0854 0104 1 
0145 .0104 0289 
1 1 1 


SS 
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0 


) 
R= 
0 
1 0 


l¢p | finds the efficient portfolio X" with maximum E. This is easy 
the only constraint equation is 


Sra! 


portfolio with maximum E consists entirely of the security with the 
In the example presented above the portfolio which 


xO e/a]. 


i a more general set of equations AX = 6 constrains the choice of 
WWlolio, it may be necessary to use linear programming to find the port- 
lo which maximizes E! (Linear programming is discussed in a subse- 
IM Section.) 

More than one portfolio may maximize E. When 


yr=1 


the only equality, for example, more than one security might have the 
‘me (maximum) 4,. One way to handle this is to alter the ys, slightly 
f). change a .125 to .1251) to obtain a unique maximum. Another 
Hethod, which does not require alteration of the problem, is presented in 
{ii} In the present discussion we shall assume that the maximum is 
whique. 

‘Siep 2 finds the formula for the critical line associated with X", We 
livsirate the procedure with our example before describing it for the 
‘prneral case. 


‘The present discussion assumes that there exists a portfolio which maximizes E, | 
TW could fail to be so only if | 
(0) no X satisied AX = 6, 
X20, oF 
(b) there was no upperbound to the obtainable E. 
fiher condition would be revealed by the linear programming computations, In 
al ia) the computation is over. In case (b) the efficient set must be traced by starting, | 
X rather than from 2. 


j= andj =3 are out. The equation 


of] =R+ Sy 
a 


1 o 0d 
0 0854 0 1 
OP Op erly 10) 
Oe ete 2 07101 


The inverse of a matrix with unit crosses can be obtained by 


(a) eliminating from the matrix the rows and columns with unit cros 
(b) inverting the matrix which remains, and 
(©) reinserting unit crosses in the inverse in the same position as in 


original matrix. 


Thus to invert Mf above we first invert 


0854 
1 


This is done with dispatch since, in general, 


ie) 


as ie i | 
lence i 
0854 17-1 1 A 
er —.0854. Shere C is an m by m submatrix of C; Aisa non-singular m by m sub- 
and Watrix of 4; A’ is the transpose of 4; 0 is an m by m matrix of zeros. 
Hy Onn) 0 pee * 
TON eg Pellet fo) 
Ton ROME ae Hi 40 (4 -ay cao 
0 1 0 —08s4 Mi) is obtained by introducing zero crosses into the above inverse for 
Substituting zero crosses for the unit crosses of variables which are ae and columns of the out variables, The formula for the first critical 
we get ne is 
OER Be (}] = MI): R+ AgMS. 
01010) 0%) a 
bia aot at oy ae Letting 
0 1 0 —08s4 TU) = KR, 
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; 
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The values of ¥ and A along the critical line are given by 


ve 
[7] = MIR + ApN(1)S; 


. i xy 0 0 
0 146 

EEA ional TAPE IE sally 
M4 x Pela ule 
: a} [0854 146. 


the present critical line only 4, changes as 4, changes. The X, remain 
Want. This is always true of the first critical line of the computation. 
Jiean also happen, but rarely does, in subsequent steps of the computation. 
Wii happens, for example, at point a in Figure 12 of Chapter VII.) It 
r es convenient to handle such “degenerate” critical lines in the same 
jiner as we handle the more usual critical lines. 

A similar Step 2 procedure is followed when the portfolio is subject to 
} More general system of linear constraints, AX = 6. If there are m 
‘Pitraint equations, then m variables will be in for the portfolio which 
Witimizes E. In linear programming parlance these are the “basis 
Wrlables” of the optimal solution. If we eliminate all the out rows and 
‘Wlumns from M, we have a matrix 


Ul) = MDS, 
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then the first critical line is Vii first segment of the efficient set lies along the critical line which 


Witisects /, at the highest value of Ay: in the present example the first 
‘Hpent lies along /,. 

Miep 3 in the general case may be described as follows. Let v, be the 
fomponent of the vector 


x 
[7] = Tl) + Ul) dg. 


On this first critical line only the 4, not the X, changes as Ag changes, 
Step 3 finds the values of Ap at which our first critical line intersects 
critical line with the following three properties: 


V=M-T7(I). 
141 w, be the jth component of the vector 
W = MUI) -S, 


(a) all variables are in which were in on the first critical line, 
(b) one additional variable is also in, and 


(6) all other variables are out. ® , = 0 forj which are in. 212 = —vjfw, is the value of dy at which 


inst critical line intersects the one for which the jth variable is also in. 
if W, = 0 there is no intersection.) The next critical line to be efficient 
H ihe one which intersects at the highest! Ap. (If this largest Ag is zero or 
Myiilive, the problem is over and X") gives minimum V as well as 


In the language of Chapter VII, the first critical line of our example 
‘We must now find the values of Ay at the intersection of /, and /,y 
at the intersection of /, and /p.. At the intersection of /, and |. we 


OX + ake + 03% +A, = my 


as well as eum E.) : 
Y ‘Mep 4 sets up the formula for the new critical line. In the example the 
[ | = Til) + Ag? UC) fe M differs from the old Mf in that j= 3 “goes in.” N(1) must be 
4 Wilified accordingly to give us (2). A general procedure for doing this 
Thus 1 the following: 
‘ (215 12 O13 INTO) + AgU()} = Hyde; Hvppose, in general, that the variable j, goes in for the (i + 1) critical 
that is, Let C,, be the jth column of M. Let 
) 
B= MiCy 
[.0146, .0187, .0145, 1) b= 8, 
—.0854 OSS 


Where my, is the element in the jgth row, jgth column of M, The elements 
i) of N(i + 1) can be expressed in terms of the elements of f,, of N(i) as 


0 
+ [.0146,.0187,.0145, 1} | ae = 062 Hows: 


1 


146 


Sida i 
Thus, at the intersection, 
~.0667 + .146Ag = .062Ap, 


Ap = 794. 


4 ey 
85, = Bie = — fori joy 


Where 6, is the ith component of B. 
Similarly at the intersection of |, and /. we have 


b i . , - 
[21s G0m G50, ITU) + Ula} = ade’ um iy Oh for! # fy and J # fy 
that is, 
—.0750 + 1461p = 1281p, 


"Ties may be broken arbitrarily. There is a small danger of cycling (eventually 
dg = 4.17. 


‘frining back to the same critical line a second time) with this procedure. In this event 
wei. 


eS 
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Thus, changing N(1) into N(2) for our example: 


000 0 0145 0 
FR ed 0104 i 
ONO) 0h 0 0209) lop lbue Oia 
0 1 0 —.0854}[ 1.0000} — | —.0750, 
14s 
0104 
6 = (0,1, 0, —.0750] = —.0646, 
0289 
1.0000 
¢ = 0289 + .0646 = 0935, 
fo 0 0 0 
‘4 1 1 _ 9750 
0935 (0935 0935 
ney = 10 1 1 0750 
0935 0935 0935 
—.0854 
9 1 0750 0750 + (=.0750" 
05350935 0935, 
ro 0 0 0 
© — 10.6951871 —10.6951871 —_.19786096 
*}o —10.6951871 —10.6951871 —.80213904)" 
[0 19786096 80213904 —.02523957. 


If N(2)and Mare multiplied, the product has unit crosses correspondi 


in variables and a zero cross corresponding to the out variable (j 
The formula for our new critical line is 


- 
("] = NQ)R + NQ)Shy: 


that is, 
%, 0) 0 
ai Metal 193) 
ve, 802 =.193 
A = .025 132 
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When Ay equals the value established in Step 3, 


Ap = 4.17; 
tn 

X, = 1.00 
a 

X,= 00. 


Ay |p is reduced, X; falls and X, increases. The points generated by (1) 

‘shove are efficient for Ay from 4.17 down to the value of 2, at which the 

sent critical line intersects another (or until 4, = 0, whichever happens 
Wi), 

Sep 5 finds the critical line which is first! intersected by our present 
Hilieal line as Ay falls. In the notation of Chapter VII, our present 
‘Wilieal line is /,5. We need only consider its intersection with /, and /, +5. 
{We need not consider /,, from which we just came.]  /p intersects /, when 

X= 198 + 193A = 0, 
Hat is, when 
Je <0 
Ay intersects 4... when 


{2415 G12 Oy, W{T(2) + U2 4} = fader 


where 
TQ) = NQ2)*R 
nd 
U(2) = M(2)S. 
Al the intersection, then, 
=.0099 + 131A = 0624p. 
14. 


Thus the first critical line to be intersected, as we reduce 4», from 4.17, is 
{4 intersected at 


The portfolio at which the efficient set turns from / to 2,3 has 


oC 
a] = (2) + 19UQ2). 


Specifically 
x 0 Onan (Ao 
X,| =| .198] 4.14 | .193| =| .22 
ve, 802 —.193]  L.78 


! See footnote on page 181, 
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Step 6 is like Step 4. We now set up the formula for the critical | 
‘29. As described there, we derive the new (3), which turns out 


111.2347 —27.00556  —84.22914 1.102335 
—27.00556 17.25160 9.75396 —.069764 
—84,22914 9.75396 74.47518 — —.032572 

1.102335 —.069764 — —.032572 —.014315 


NQ) = 


From this we calculate that /,.3 is 


x 
[i] = (3) + AUG) = NOR + Ag N(3)S, 
that is, 


25) 1.10 1.83 
iG 2.09 
= A ei 
28 Mea een) 
a 05 


‘Step 7 is like Step 5. As Ag moves down from .14, we find that 
intersects /,.5 when 


Ny = -.07 + 2.09 = 


that is, when A = .033, |... intersects / » when 


X= —.033 + 5.73Ag = 0, 


that is, when Ap = .006. Thus the first intersection is at A\} = .033 
the next segment of the efficient set lies along /, 5. 
The portfolio at which the efficient set turns from /, 930 hg is 


x 110 7.83] [84 
¥,| =| -.07| + .033] 2.09] = | 00]. 
XJ | =.03 5.73] 16. 


Step 8, like Steps 2, 4, and 6, produces the formula for the new 
line. This time, to obtain A(4), we must replace a row and col 
(namely, the second) by a unit cross. Whenever a zero cross repl 
row and column (/,), the g,, of the new N(i + 1) can be obtained fro) 
of the old N(i) by the following formula: 


=, — fhe. 
Bu = Ss 1 
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this we derive 
68.96034 0 —68.96034  .99313 
0 0 C) 
—68.96034 0 — 68.96034 —.00687]" 
99313 0 .00687 —.01460. 


formula for the new critical line is 


(}] = Ty + AU): 


N(4) = 


08 9931 4,551 

% 0 00 
t= +45 

| | 0069 $4552 

a} [-o146 06245 


Ay moves from .033 downward, it reaches Ag = 0 before it intersects 
ther critical line. Hence the efficient set ends at 


x 
(}] = T(4) + 0 U(4) = M14), 


\fically, the portfolio which minimizes V is 


x) 7.9931 
X,| = |.00 
x] — L.ovs9. 


ince the set of efficient portfolios is piecewise linear, it may be described | 
ils starting point, the points at which it turns, and itsend point. These | 
Nave found for our particular example to be 


begin Isttum 2ndturn —end 


Xe 00 84 # 
Rall it 2 00 cc 
cat) 8 16 ol 


‘My linear interpolation of adjacent pairs of such “corner” portfolios is 
Wlent. In this manner a portfolio associated with any efficient E, V 
PMbination can be constructed, as was illustrated in Chapter II. 
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The set of efficient £, V (or £, a) combinations can be sketcht 
evaluating and plotting the combinations associated with various ¢ 
portfolios. Or a method such as that in [11] can be used to ol 
formule describing the curve. 


Perimetric programming can give us the optimum portfolios for all 
Wellies of £°. 

} Rather than maximize minimum return (j.c., minimize maximum 
Jos) we could formulate a linear programming problem to minimize 
Setiage loss or average absolute deviation from E. 

The simplex method of linear programming and the critical line method 
a programming are su! tly similar that a machine program 

ihe former can be easily modified to serve as a machine program of 
Wy latter. This is shown in Appendix B. 


LINEAR PROGRAMMING 


A linear programming computation seeks the maximum (or mini 
of a linear function of n variables, 


‘Other computing techniques may also prove valuable in the analysis of 


x XC, ‘ 
im Prifolios. These include 
subject to m linear constraints, (a) Dynamic programming [17]; see Chapter XII; 
ay, {b) Monte Carlo techniques [18]; see Chapter IX; and 


{e) Gradient methods [20] (for maximizing a linear or a “well-behaved” 
PeHlinear function subject to linear and/or “well-behaved” non-linear 
‘Syjulraints. In the purely linear case the simplex method is superior). 
(i) Our £, ¥ efficient set computation is an example of quadratic 
Popramming, [11], {12}, and [13] discuss methods of quadratic program- 
ring 


and subject to 


G. Dantzig’s “simplex method” is a general and efficient comp 
technique for solving such problems. Descriptions of two variam 
this method are to be found in [14] and [15]. 

Techniques have also been developed to find all solutions of the resil 
linear programming problems as the b, vary according to a linear rul 


by = Oo + 1b}. 


This is a case of “parametric” linear programming (16). 

‘A number of portfolio analysis problems can be reduced to eill 
linear programming or a parametric linear programming com 
problem: 


1. The problem of maximizing expected return subject to linear 
straints, as at the beginning of an E, V efficient set calculation, is a 
programming problem. 

2. Suppose security ihad return r,,at time ¢. The return on a por 
M+ ++, X, at time tis 


Re= Drake 
By suitably introducing dummy X,, the problem of finding the poi 
whose smallest (i.e., most negative) R, is as large (i.c., least negati 


possible can be formulated as a linear programming problem. We 
also require 


Diu; = 


THE SEMI-VARIANCE 
definition, 


= (« =) if (* — b)is less than or equal to zero; 
poe if (r — b) is greater than zero. 
Por example, with b = .2, 


(Carrer IX ‘ 
if r equals: then (r — .2)- equals 
THE SEMI-VARIANCE ol aol 
45 0 
-4 = 6 
0 -2 
INTRODUCTION ei a4 


In this chapter we consider the semi-variance as a measure of 
First, we define the semi-variance; next we compare the semi-vati 
with the variance, noting similarities and differences and pros and 
cach. After this we consider a geometric analysis and computing 
dures for the derivation of efficient portfolios based on expected 
and semi-variance. 


ie the mean value of {(r — 5)-]*. If r takes on the values of the above 
ple, then 
=) (= 6) + (2) 4.(—3)) 
s,=6 De l= GY asad Deke) Sr eg 
5 5 
If) = 0, 5, is the same as 5, defined previously. 
‘Jy is the mean value of [(r — £)-}*, where £ is the mean value of r. 


Derinitions J) the example above, E = 02; hence 
By definition, (=.42?'+ (~.12)? _.1908 
__ fr ifr is equal to or less than zero, SP Sas orars ens ee 
“0. ifr is greater than zero. 
For example, Bikey 


ifr equals, then r= equals: i 
a 0 The distributions in Figures 1A and 1B illustrate the notion of a 
5 0 iy about £. The two asterisks labeled (2), .075 unit below E in 
. i. 1A, correspond to the two asterisks labeled (a’), .075 unit above E 
aa te (pure 1B. The asterisks labeled (6), (c), and (d) in Figure 1A similarly 
0 0 ‘PHiespond to asterisks (b'), (c’), and (d’) in Figure 1B. 


We could reflect a distribution about a point other than £, The 
‘Miribution in Figure 1C, for example, results if we reflect the distribution 
‘) Pipure 1A about the pointr = 0. When we reflect a distribution about 
pected return, the resulting distribution has the same expected return. 
‘When we reflect a distribution about any other point, the resulting distri- 
Wilon has a different expected return. If we wish to confine our attention 
} distributions which all have the same expected return, we must restrict 
‘Wihelves to reflections about E. 
Henceforth, when we speak of the reflection of a distribution we mean 
WW fellection about E. 


= =| 

Sq is defined to be the mean value of (r-)*._ If ris a random variable 

future event subject to probability beliefs, then 
Sq = expt(r-). 


Tf ris the past return ona portfolio, Sp is the average (r-)*. If r takes 
the values .1, .5, —.4, 0, and —.1, as in the above example, 
n 


(4? + ( 


S= 


7 
= — = .034. 
5 
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A distribution is symmetric if reflecting it about £ reproduces ¢ 
the same distribution, The distribution in Figure 2 is symmetric; # 
in Figure 1 are not 


We left. V/2Sy may be used as a measure of skewness. For symmetric 
‘Wiributions ¥/2Sg equals 1. If extremes to the Tight outweigh those to 
Whe left, ¥/2S, is greater than 1; in the opposite case it is less than 1. 

When all distributions are symmetric, or have the same degree of 


i ‘WHvness, the efficient portfolios produced using Sp are the same as those 
shores (8) | (a) Wing Y as the measure of variability. If the degree of skewness varies 
ae a eR) analyses based on Sy and V may differ in their 
TP SE a am Boe 8. Among portfolios with the same expected 
ou Miuin and variance, an analysis based on Sy chooses the portfolio with 
{ Hii preatest skewness to the right, or the least skewness to the left in its 
¢) e)o) @ Miibution. 
[ol ae Eee | peso ae 
-» -5 -0 —-m 0 a 10 is 20 WWLANTIES AND DIFFERENCES 
(B) 
: It us consider changes in distributions, particularly changes which 
dt a f) the mean of the distribution the same. Let us compare the effects 
() 2 Which such changes have on variance with those they have on semi- 
(Cuetec. 1 aif | eT ae Wlance, Such a comparison will illuminate the objectives of an analysis 
-0 =-15 10 =.05 o 05 10 4s a 


Phich minimizes S for given E, as contrasted with an analysis which 
Hiimizes V for given E. 
Distributions with the same mean may be thought of as being related by 
lies of changes, each change involving only two points.’ The distri- 
Hin in Figure 3D, for example, may be derived from that in Figure 3A 
first changing the points (a) and (6) in 3A to (a’) and (b') in 3B; then 
Wiping (c) and (d) in 3B to (c’) and (d’)in 3C; and finally changing (e) 
W(/) 10 (e') and (f’) in 3D. In each of these changes the two points 
We equal distances in opposite directions. This is necessary to keep 
WH wean E constant at each step. 
We shall refer to such movements of two points, equal in distance but 
Horite in direction, as binary changes. 
‘Muy binary change which moves two points closer decreases variance. 
Yl binary change which further separates two points increases variance. 
# true irrespective of the location of the points. Thus binary changes 
Which move together the Points (a) and (a') in Figure 2, or the points (a) 
i (b) or (a’) and (6) in the same figure, would all decrease variance. 
opposite binary changes, which further separate these points, would 
Wii inctease variance, 


© 


Figure |. Reflections. 


‘The variance of a distribution is always the same as the variance 
Teflection. The variance of the distribution in Figure 1A, for exam 
the same as the variance of the distribution in 1B. An analysis 
E and V would consider these two distributions equally desirable. 


tc) (c') 
a) E (o! 
: (b) (b') . 
i j 


L 
0 


Beas 


Figure 2, A symmetric distribution, 


The semi-variance of a distribution, on the other hand, is not alway 
same as that of its reflection, Sy, for example, is smaller for the 
bution in 1A than it is for that in 1B. S,, for a wide range of b, is. 
smaller in 1A than in 1B. 

A distribution which shows greater extremes to the right than to t 
of £, as in Figure 1A, is referred to as skewed to the right. A distri 
Which shows greater extremes to the left of E is referred to as ske 


“tiers, as clacwhere, we restrict ourselves to distributions represented by a fini 
of asterisks in a figure such as Figure 3. It is not necessary that both distri- 

i be represented by the same number of points, since a distribution represented 

*\ points and one represented by ky points can both be represented by ky times ky 
* 
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The amount by which variance changes does depend on the location 
the points. The effect on variance of a small change in the position 
Point is roughly proportional to the distance of the point from E. As 
change of a point at (a) or (a’) in Figure 2 has about five times the ¢ 
on variance as does an equal change in a point at (c) or (c’). A § 
binary change in (a) and (a’) has about five times the effect of an eq 
binary change in (c) and(c’), Since (a) and (a’) are on opposite sides of, 


Hiange does not affect Sp. Thus the movement of (a) and (b) together or 
the similar movement of (6) and (c’'), in Figure 2, would decrease Sy. 
‘The movement of (c’) and (a’) toward each other would have no effect on 
‘Jy The movement of points below E affects Sy exactly as such move- 
Wents affect V. 

Similar remarks apply to S, for & not necessarily equal to E, Binary 
Movements which bring together two points, one or both of which are 
Wow b, reduce S,. Binary changes in points which are and remain above 


(a) (b) . Phiave no effect on S,. In the case of movements of points to the left of b, 
t i i i i L 1 the effect on S, is exactly the same as the effect on V.! 
18-10 95 0 05 10 AS To the left of 6, S, evaluates changes (which keep E the same) exactly as 
Cy) F evaluates them. To the right of 5, changes do not affect S,. Out- 
fies greater than 6 enter an analysis based on S, only in so far as they 
; AG Affect £. 
HM tlk Hanes ribs cans hc, 
=15 10 = 05 0 05 10 15 
®) Phos ano Cons 
"e) Several conflicting considerations influence the choice of V or Sas the 
r) ata “) ie @) Measure of variability in a portfolio analysis. These considerations 
L L 1 L 1 n i iNelude cost, convenience, familiarity, and the desirability of the portfolios 
Bal Den hth OL eee oduced by the analysis. 
© Variance is superior with respect to cost, convenience, and familiarity. 
. Por example, roughly two to four timesas much computing time is required 
+) * (on a high speed electronic computer) to derive efficient sets based on Sy: 
t Tei f a: f i {han is required to derive efficient sets based on V. In an analysis based 
7A 10.05 t) 05 lo 15 00 ¥, only means, variances, and covariances must be supplied as inputs; 
© 


whereas an analysis based on S requires the entire joint distribution of 
felurns, Unlike semii-variance, variance and standard deviation are 


+ Rinaay eianges, tnown by many people acquainted with modern statistics, 


Vs, =E2G — b)F, where this sum is confined to r, <b. Since ,—b = 
t 


ty £) + (E-6), 


luces variance; the movement of (6) away from 
incteases variance. The reduction due to the movement of (a’) is greal 
than the increase due to the movement of (6'), because (a') is farther fr 
E than is (6'), 
A binary change which moves two points together decreases Sip as lo 
as one or both points are below £. If both points are above £, the bina 


J u 
Sm y Ele 2)+ E-D) 


= i (2 — EY + XE — dC) = KE) + K(E — 5), 
t 2 


Where X is the number of asterisks below 6. Since binary changes do not affect Sr, 
the effect on S, is the same as the effect on (1/7) (ry — E)¥, 
t 
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This superiority of variance with respect to cost, convenience, 
familiarity does not preclude the use of semi-variance. The comy 
cost involved even in the use of semi-variance is small compared with 
other research costs involved in supervising one or more large portf 
The requirement, by analyses based on variance, for covariances 
tween each pair of securities leads to models for derived covaria 
models which are sometimes equivalent to, sometimes little less 
models of the entire joint distribution required by an analysis 
on S. Familiarity, finally, is a transient thing: use can make 
familiar as V. 

Analyses based on S tend to produce better portfolios than those 
on V. Variance considers extremely high and extremely low 1 
equally undesirable. An analysis based on V seeks to eliminate 
extremes. An analysis based on Sj, on the other hand, concentra 
reducing losses. 

Efficient portfolios based on variance, however, cannot be characte! 
as bad or undesirable. If all distributions of returns are symmetri 
have the same degree of asymmetry, V and S, produce the same 
efficient portfolios. Since a portfolio with very low variance avoids 
extremes, it must have a low Sg as well. The only complaint one 
raise about such a portfolio is that it sacrifices too much expected 
in eliminating both extremes. 

Clearly there are pros and cons on each side. The proper procedu 
‘seems to me, is to start with analyses based on variance. Analyses 
on semi-variance, and those based on still other criteria, such as 
specially constructed utility functions discussed in Part IV, can be 
sidered after experience is gained with simpler measures, In the 
of criteria, as well as in other respects, the form of analysis must 
expected to evolve. 


GEomETRIC ANALYSIS 


This section will present a geometric analysis of E, S, efficient porti 
based on the historic returns of the three securities in Table 1. We 
assume the standard constraints, 


420, f= 1,23, 


ytnty 


Modifications of this analysis required by the use of S, or Sy rather 
Sy are noted. 
Suppose that a particular portfolio (X,, X,, Xs) incurred a loss ( 
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Taste 1 
RETURNS ON THREE SECURITIESt 


Security 
Year 
! 2 3 
AT. &T. USS. Ban. 

1937 173 38 =319 
1938 098 285 016 
1939 200 —.047 381 
1940 030 104 —.051 
1941 — 183 17h 087 
1942 067 —.039 262 
1943 300 149) 34l 
1944 103 260 207 
1945, 216 419 382 
1946 — 046 —.078 133 
1947 -.071 169 —.099 
1948 056 —.035 038 
1949 038 133 23 
1950 089 732 091 
1951 A 021 054 
1952 083 ABT 109 
1953 035 006 210 
1954 116 908 M2 


{ Securities 1, 2, and 3 of this table are 2, 3, and 7, respectively, of Table 1, 
Phupter 1. 


Li. <0) in years .° + +, 1, and made a profit in years ticyas * bye 
/ example, the portfolio (1, 0, 0) has 
(t+ + tx) = (1937, 1941, 1946, 1947), 
(tacsny "> sty) = (1938, 1939, 1940, 1942-45, 1948-54). 
In this case K = 4. 5S, for such a portfolio is, by definition, 
eer 
22%? 
=fit ' 
ip 


Where r,,, is the return on the ith security in year f,. Sp is also equal to 


So 
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a) Se= Dstt XM, ‘Mioviated with any profitability polygon is a critical line similar to that 

where vs W the three-security analysis based on E and V. The critical line for the 

1k ljpon is the locus of points which give minimum So(t,~ ~~, 4) for 

@) Sully 4) = po fale i £, ignoring the inequalities X, > O (where Sy(t,,* ~ +; 4) is defined 
a te 


(Note the similarity between the definition of s,, and that of 9,,; also 
similarity between (1) and the formula 


V = Yoo XiXp] 

If portfolios P and Q have the same unprofitable years (t,, «+ 

Sisllty * * +) G)) used in computing 5p by (1) are the same. 
For any year f, the portfolios which satisfy 


Estes 5 XX). 
7 


Prcept in certain degenerate cases ignored here, the critical line is, in fact, 
F Mraight line, 


Dr <0 
are unprofitable. Those which satisfy 
DXi, > 0 
are profitable. For example, for = 1948 the unprofitable portfe 


satisfied 
056X, — 035%, + 038%, < 0; 


or, substituting Y= 1— ¥, — ¥,, unprofitable portfolios in 1948 
018%, — .073.X, + 038 < 0. 


The line marked 1948 in Figure 4 is the locus of all portfolios with 
Profit in 1948. The arrow attached {o this line indicates the portfol 
which were. unprofitable in that year. Thus any portfolio above or to 
left of the 1948 line was unprofitable in that year. Lines and arrows | 
Figure 4 similarly indicate the unprofitable regions for other years. 
portfolios below the 1947 line were unprofitable then; portfolios al 
the 1941 line were unprofitable then. Some years do not appear el 
because all portfolios were unprofitable (1937) or all portfolios 
profitable (1938, 1943-45, 1949-54). 

The “profitability lines” divide the set of obtainable portfolios i 
“profitability polygons.” The polygon labeled 41, 46 consists of 
portfolios which were unprofitable in 1941, 1946, and 1937. 
labeled 41, 42, 46, 48 contains all portfolios which were unprofitable 
1941, 1942, 1946, 1948 as well as 1937. 

Portfolios within the same profitability polygon have the sal 
Sifty,* + yl). Thus for all portfolios in the polygon labeled 41 we hi 


Figure 4. Profitability lines and polygons. 


The portfolio (0, 1, 0) is efficient since it has maximum E. This point 
inthe polygon 39, 41, 42, 46, 48. Its critical line is computed, therefore, 
With 5:,(37, 39, 41, 42, 46, 48). If we compute this critical line, we find 
hat it passes to the left of the point (0, 1,0). Hence the E, S efficient set 
Marts by moving down the left side of the set of obtainable portfolios. As 
F decreases, the efficient set intersects the 41, 42, 46, 48 polygon before the 
Wilical line of 39, 41, 42, 46, 48 intersects the set of obtainable portfolios. 


5,(1937, 1941) = z {(—.173)8 + (—.183)2} = .00352. 
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ombinations, though finite, is extremely large, are presented in the 
} Section. 

‘The present section will draw on the notation and results of Chapter VIII. 
1 an E, S analysis the critical line is determined by 


Hence the efficient set moves down along the X, axis (Figure 4) to 
polygon 41, 42, 46, 48 without turning. Similarly the critical line of 
polygon 41, 42, 46, 48 does not intersect the set of obtainable por 
before the efficient set reaches 41, 46, 48. The story is repeated for 
gons 41, 46, 48 and 41, 48. The critical line associated with 41, on 
other hand, intersects the attainable set before the efficient set inter: 
new polygon, Hence the efficient set (shown by a heavy line in Figu 
moves along the critical line of the 41 polygon. This critical line 
intersects the 41, 47 polygon. The efficient set then moves along 
critical line of the 41, 47 polygon, as indicated in the figure. The 
line of 41, 47 intersects the X, axis before intersecting any new pol 
The efficient set then moves along the X, axis until it reaches the 
with minimum 5p. 

5, efficient sets for 6 #0 may be similarly derived. Polygons 
defined with boundaries 


1h) the set of X, which are in as opposed to those which are out (as in 
£, V analysis), and 
1)) the set of years 1,,* * -, t4¢ which are unprofitable. 


ritical line associated with a set of “in” variables is the same as in the 
| V case except that a, is replaced by 


Testu te) = 3 Con Blt 8) 


y decreases, the critical line may 


{\) first! intersect another critical line defined in terms of the 
J sh). This is recognized and handled exactly as in the E, V 
lysis; or 

1) first intersect a profitability line: 


DXru 
A, at which each such intersection takes place is computed from 


(7, OMMIDR + NSAz} = b, 


Dek = 5 


the sifu,” °°) fx) are defined as 


1e 
Tels — Nr, — ). 


b 


The efficient set is traced out as in the E, Sy analysis. 
When E and Sy, are used as criteria of efficiency, the boundaries 


polygons are 


TMG. - a) = 0. We 


(75,0) = (in + sty 0,” * *4 0). 


save (ii) the 5,, must be modified and a new M(i) produced. Case (ii) 
two subcases: 


The 5,,(t),* + *, lx) is defined as 


ex 
Fm Mtg — HOt, = Hs 
cat 4) year (iq) goes from the profitable to the unprofitable set; and 


The efficient set is traced out as above. 1) & year (io) goes from the unprofitable to the profitable set. 


1) ease (a) the new §,; is obtained from the old s,, by 
Teh = Ty + Palsy 


WI = Silo te ds Sig = Sul oy tg) and Fame — 

1h case (b) we obtain the new 4,, from the old s,, by T+ 5, = 7 5,5 
~ Jf, With the variables defined as above. 

1) nse (a) the M matrix becomes 


COMPUTING PROCEDURES 


We shall now discuss computing procedures for deriving efficient 
based on E and S,. Slight modifications of these procedures, as indi 
at the end of the last section, make them applicable when Sy, is used 
measure of risk. The procedures of the present section permit 
number of securities (N) and any number of constraint equations 
The procedures assume, however, as in the last section, that there 
finite number of “years.” Procedures applicable when there 
infinite number of possible combinations of returns r, or when the ni 


Mnew = gold 4 (‘) (0), 


| Tit are broken arbitrarily. Anti-cycling techniques can de built if needed. 
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where 


0 = Guys 


‘Questions concerning the proper manner of selecting a sample of returns, 
the accuracy afforded by such a sample, are in the realm of Monte 
lo analysis. The reader is referred to [18] for a discussion of this mode 
alysis beyond the few remarks to be made here. 

Correlated sampling is a Monte Carlo technique for increasing the 
wwney of comparisons. Suppose, by way of illustration, that portfolio 
‘id portfolio 2 have the same expected return. Let S, and S, be the 
Values of Sz for the two portfolios. Let S, be the estimate of 5, 
Wyided by a sample of combinations of returns; let S, be the estimate of 
We wish to estimate whether 5, > 5,; that is, we wish to estimate 
her S, — 5, is positive or negative, The variance of S, — S, equals 


var(S,) + var(S,) — 2cov(S,, S)- 


Th sccuracy of the estimate of 5, — S, is improved if S, and S, are 
Poillively correlated. Var(S, — S,) decreases as cov(S,, S;) increases. 

Correlation between S, and S; is automatically obtained, as a rule, when 
jame combinations of returns are used to evaluate both portfolios. If 
ticular sample has a disproportionately large number of bad years, it 
fend to overestimate both 5, and S,; and conversely if the sample 
ins a disproportionately large number of good years. Thus the 
pnw of basing estimates of S for all portfolios on the same sample 


In case (b) the M matrix becomes 


Mew = yfold — (é) (7,0). 


The new M(i) can be obtained by the general rule that, if 


B=A+w', 


where B and A are matrices, ua column vector, and v’ a row vector, 


(Am uyv'A-) 


Bla gt 
1+ 0'A“u 


With the new (i), the computation proceeds as before. 


Monte Carto PROGRAMMING OF E, 5 EFFICIENT PORTFOLIOS 


The computing procedures presented in the last section apply when 

are a small number of possible combinations of returns. An al 
based on past annual returns, for example, has only one combination 
each year. An analysis based on probability beliefs—as expressed, 
example, in the models for derived covariances discussed in Chapter 
can admit extremely large numbers of combinations of returns. It 
be impractical to apply the methods of the last section to all such 
combinations. 
Approximate, but fairly accurate, results can be obtained by a 
the techniques of the last section to a random sample of possible com! 
tions of r. For an analysis involving 100 securities, 2 sample of 
combinations seems to be a reasonable compromise between the needs 
curacy and the burdens of computation.’ The portfolio which 
mizes S based on such a sample will have a true 5 almost as small ay 
of the portfolio which minimizes true S. 


More accurate comparisons than if independent samples were drawn 

ih portfolio, 
Avother Monte Carlo technique for increasing accuracy is that of 
Pyertance sampling. This technique directs that combination of returns 
Wl extreme effects on S should be sampled more heavily than other 
binations. The sample must be weighted afterward to remove the 
in selection. if importance sampling is used in selecting combinations 

lurns for an E, S analysis, the objective of reducing computing require- 
ils, as well as that of increasing accuracy, must be kept in mind. 
ability lines (planes) which do not intersect the set of legitimate 
Molios add little to computing time. Thus extremely bad years in 
eh all portfolios have return less than £ (or ) should be sampled most 
Wvily, since they give rise to both large effects on S and small require- 

* It is desirable for the sample of combinations of returns to be larger than the n a ncn techniques and techniques of mathematical 
of securities contained in the analysis, This would apply if we sampled from a t Jbgramming are thought of as competitive: the former is used when the 


10 obtain covariances for an analysis based on V, as well as if we sample combina a 
for an analysis based on S. If an analysis of 100 securities is based on 99 combi lublem is too complicated for the latter. In the present case, however, 


Of reiurns, there is at least one (legitimate or non-legitimate) portfolio with zero vat ily (Wo techniques are complementary. Both techniques of Monte Carlo 
We have here a “degrees of freedom" problem similar to that in multiple ‘Wl those of quadratic programming are required to provide portfolios 


‘An analysis bas i . sae fs 
Bee Lop sed a a random sample of 200 combinations of returns provides Hh S reasonably close to the minimum S attainable for given E. 


PART IV 


RATIONAL CHOICE 
UNDER UNCERTAINTY 


CHAPTER X 


THE EXPECTED UTILITY MAXIM 


‘Theory oF RATIONAL BEHAVIOR 


A portfolio analysis is characterized by 


{1) the information concerning securities upon which it is based; 

1) the criteria for better and worse portfolios which set the objectives 
the analysis; and 

1)) the computing procedures by which portfolios meeting the criteria in 
ive derived from the inputs in (1). 


results of a portfolio analysis are no more than the logical conse- 
Wees of its information concerning securities. Although they may not 
‘pparent to unaided reason, the results, nevertheless, are but restate- 
i's of these inputs. 
Moadly conceived, criteria include that for distinguishing legitimate 
Non-legitimate as well as that for distinguishing better from worse 
Wolios. The criteria determine the type of conclusions sought. It is 
iossible to derive all consequences of inputs. The criteria of an 
lysis is its guide to what is important and unimportant, relevant and 
evant. 
‘The computing procedures are the means by which the information 
|| securities is transformed into conclusions about portfolios. With- 
effective computing procedures, an analysis cannot be applied to real 
folio selection problems. 
Computing procedures are based on the relationships between securities 
‘Wi! portfolios, as well as on the theory of mathematical programming. 
$a 11 of this monograph dealt with relationships between securities and 
Wfolios. Part IIT dealt with principles and rules of computation. 
Pijendices A and B cover some aspects of computing procedures which 
ire more mathematical background than was assumed in Part III of 


hy Jext 


in large part, this monograph is a discussion of the relationships 
205 
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and procedures by which information about securities is transformed ‘Vie is no integrated theory by which we could dispense with human 
conclusions about portfolios. ‘Wings if we had a sufficiently large and fast computer. The study of 

Problems concerning the proper information to serve as the basic j ‘WWilonal behavior has produced only general principles to be kept in mind 
concerning securities are outside the scope of this monograph. There ‘# ides. Even the significance of some of these principles is subject to 
no magic formulas to supplant the sources of information and the rul WMiroversy. The value of the study of rational behavior is that it supplies 
judgment of the security analyst. The portfolio analyst requires that ») With a new viewpoint on problems of criteria—a viewpoint to be added 
final judgments of the security analyst be cast in a somewhat new fé 1) common sense to serve as a basis of good judgment. 

He cautions the security analyst against jumping too directly from 
properties of a security to its place in a large portfolio. Other than 
the problem of arriving at reasonable beliefs about individual secu 
lies, as always, in the realm of the security analyst. 

The remaining chapters discuss matters relevant to the choi 
criteria, Part 1V is based on studies of “rational behavior” in the 
risk and uncertainty. These studies seek principles of reasonable a 
when the future is not certain, They seek maxims to guide us in, ai 
other things, the choice of criteria for the analysis of portfolios. 
The theory of rational behavior is usually presented as a study of 
principles upon which a rational man would act. This Rational Mi 
unlike you or me in that he makes no errors in arithmetic or lo 
attempting to achieve his clearly defined objectives. He is like you 
me, on the other hand, in that he is neither omnipotent nor omnisch 
He must make decisions, such as the selection of a portfolio, in the fg 
uncertainty. Since his information is limited, he may take less 
perfect actions. Since his powers are limited, his achievements may | 
short of the best conceivable. Every action, however, is perfectly thor 
out; every risk is perfectly calculated. 

The Rational. Man, like the unicorn, does not exist. An attempt to 

general principles by which he would act, however, can be suggestive 
our own actions. 
Another interpretation can be given to the theory of rational beha\ 
Rather than visualizing a Rational Man we can visualize a Perfect Ci 
puting Machine with unlimited speed and capacity. This compu 
machine instantly processes all available information to determine a 
portfolio for us, The study of rational behavior may be viewed as 
enquity into the principles by which we would have this Pei 
Computing Machine proceed. 

The Perfect Computing Machine, like the unicorn and the rational U = (1/21) + (1/2K—1.3) = —15, 
does not exist. Limitations of speed and storage will always constitul 
practical as well as a theoretical restriction. Reflection on what we This is less than the utility of having zero return with certainty. ‘Thus the 
ask of such a machine, however, can provide a perspective on what Wlividual maximizing the expected value of utility, as represented by the 
Pnoutd eek oh our ceabmachined: ‘vive in Figure 1, would prefer the certainty of no return rather than a 
The theory of rational behavior is not a substitute for human judg: 


Pericreo Unity 


When carly mathematicians first formulated principles of behavior in 
‘Wace situations, they assumed that the proper objective of the individual 
#9) to maximize expected money return. It was later found that this 
‘hjective can be incredibly bad. 

Mather than reproduce the classic example! which first convinced some 
‘Hpheenth-century scholars that something was wrong with the expected 
{#lurn maxim, we shall consider an equally convincing argument from the 
Wild of portfolio selection. An investor who sought only to maximize the 
‘jected return would never prefer a diversified portfolio. If one security 
Nad greater expected return than any other, the investor would place all 
Wis funds in this security, If several securities had the same (greatest) 
‘Hipected return, the investor would be indifferent among portfolios, 
Wyersified or not, which contained only these securities. Thus, if we 
‘Smider diversification a sound principle of investment, we must reject 
the objective of simply maximizing expected return. 

The expected wiility rule was proposed as a substitute for the expected 
Wun rule, A return of 20%, it was argued, is not necessarily twice as 
* asa 10% return; a loss of 20% is not necessarily twice as bad as a 

sof 10%. Perhaps there is a curve, such as that in Figure 1, relating 
Wility to various levels of return. According to the curve in Figure 1, for 
‘ample, the utility of a zero return is zero, the utility of a 10% gains 1, 
Whe utility of a 10% loss is 1.3. Perhaps, instead of maximizing expected 
flurn, the rational man would maximize the expected value of the utility 
Of return, The expected utility of a bet with a 50-50 chance of a 10% loss 
40 n 50-50 chance of a 10% gain equals 


* See (32), page 93, on the St. Petersburg paradox. 


208 PORTFOLIO SELECTION THE EXPECTED UTILITY MAXIM 209 


50-50 chance of a 10% loss or gain. Expected utility is lower i 
second case, even though expected return is the same in both cases, 
The modern notion of utility avoids any hedonistic interpretation 
curve in Figure 1. We are not to think of the vertical axis as rep: 
pleasure and pain. It simply represents the degree to which the indi 
is willing to take risks for outcomes presented along the horizontal 
As to the philosophic, introspective, and psychological interpretati ively rejecting the expected utility maxim as well. 
this varying willingness to take risks—this, the modern school 1h recent years a justification has been presented that goes beyond the 
brings us from firm ground to tangential areas of controversy. ent plausibility of the expected utility maxim. The new axiomatic 
‘ouch begins with basic principles which seem beyond denial, then 
jonstrates that the expected utility maxim follows from these principles. 

Axiomatic approach has revived interest and gained a large number of 
lherents for the two hundred year old expected utility maxim. 
‘Some recent commentators, on the other hand, haye argued that the 
‘Ppected utility maxim is nor the essence of rational behavior, They show 
Witances in which human action differs from that dictated by the maxim. 
‘More pertinent, they show instances where reasonable action and the 
‘Fipeeted utility rule apparently contradict. These opponents of the 
ye claim that, while the axioms upon which the modern argument is 

itd have immediate appeal, they conceal objectionable assumptions. 

‘The adherents of the expected utility maxim argue, to the contrary, that 
We existence of conflicts between actual behavior and the maxim is 
Witlevant to the problem of rational behavior—the human is frequently 
Wiplused and contradictory, As for the apparent contradictions between 
Whe expected utility maxim and reasonable behavior, adherents claim that 
‘Hpponents have misunderstood and misapplied the maxim. 

‘The matter is still not settled. At least two well-known economists who 
Isl wrote as opponents later became adherents of the expected utility 
Wasim, The writer knows of no equally famous conversion in the other 
Wivetion. Such counting of ayes and nays, however, does not affect the 
WWlits of the case. Arguments for and against are presented later. The 
fraer may find the pro and con discussions in the literature; see, for 
‘sample, (25), (26), [27], [28], [29], and [30]. Since this writer is essentially 
§ proponent of the expected utility maxim, the reader may particularly 
‘We shall refer to this formulation as the expected utility maxim. wiih to read arguments by one or more of the opponents of the maxim. 

The expected utility maxim avoids the difficulties which condemned 
expected return maxim. If his utility curve is shaped like that in Fi 
with increasingly great returns adding less and less to utility, the i 
will generally prefer a diversified portfolio. If, over the range of 
returns, his utility (U) depends on return (r), according to a for 
the form 


nvestor will select one of the efficient portfolios of an analysis based on 
ypected return and variance. The particular portfolio preferred depends 
the value of the positive number A. 

‘The expected utility maxim appears reasonable offhand. But so did the 
ted return maxim when first used. The latter maxim, we saw, is 
le unacceptable. Perhaps there is some equally strong reason for 


Unity 


40 Return, % 


Figure |. A utility curve, 


The expected utility maxim, stripped of any hedonistic interpre! 
says that the individual should act as if 


(1) he attaches numbers, called their utility, to each possible out 
and 

(2) when faced with chance alternatives he selects the one will 
greatest expected value of utility. 


PxvecteD Unity AnD THE ANALYSIS OF PorTFOLIOS 


There is no inevitable connection between the validity of the expected 
Willty maxim and the validity of portfolio analyses based on, say, expected 
‘MWlrn and variance, It is logically possible to accept the expected utility 


2 
=r-Ar, : ; : ae 
u 4 Warim and either accept or reject the use of mean and variance as criteria 
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of portfolio selection. Conversely, itis logically possible to accept 
of mean and variance and either accept or reject the expected utility 

The expected utility maxim, nevertheless, will be central to our § 
quent discussions. There are two reasons for this: 


Lint OF PART IV 


The remainder of this chapter is devoted to the analysis of choices in 
le time period choice situations. This case avoids problems of decisions 

equentially through time. It assumes that first a decision is made; 
fonsequences of the decision occur next. There are no intervening 
ons in the single time period analysis. Chapter X! discusses rational 
lavior over time, Chapter XII discusses probability beliefs, a concept 

which much of this monograph is based. Chapter XIII, finally. 
Als applications of the theory of rational behavior to problems of 


1. This writer believes that the arguments in favor of the ex 
utility maxim are quite convincing, especially for its application in 
such as portfolio selection. The maxim has to be stretched, pe 
intolerably, to apply to the making of decisions in which surprise 
fun of gambling are important motivations. These, however, al 
important objectives for the direction of a machine in the allocal 
large amounts of other people's money. 

2. The expected utility maxim, if accepted, provides a basis for di 
sing questions of criteria in the analysis of portfolios. For example, 
know that an individual 


We shall imagine an individual who must choose between alternative 
(a) acts on the basis of expected utility; ability distributions such as 
(b) never makes an error in logic or arithmetic; and 

(c) inevitably chooses a portfolio which is efficient in terms 


analysis based on mean and variance; 


{probability .001 of winning $1000, 

Moraciliy ditipnont aeaene meee eae 
(probability .999 of winning nothing; 

probability .01 of winning $100, 


probability .99 of winning nothing. 


then we also know that his utility curve must be of a particular shaj 
‘on the other hand, a second individual acts according to (a) and (b) 
but instead of (c) he 


probability distribution 2f 


f the individual selects one of these distributions, a wheel is spun and 
Outcome is announced. The individual makes no relevant decisions 
een the choice of a probability distribution and the announcement of 
Wulcome. The absence of intervening decisions characterizes this as 
‘ple time period choice situation. The present chapter is concerned 
ih such single stage situations. The problem of choice over time is 
ii with in the next chapter. 
‘We shall speak of “outcomes,” “probability distributions of outcomes,” 
ferences among probability distributions of outcomes,” and “the 
feted utility maxim." This section defines these concepts for the 
le lime period discussion. 
ly definition, one and only one outcome can result. If it is possible for 
Dee to win, for example. both $1000 and a car, then this com- 
is defined asa single outcome, If it is possible for the individual 
her win nor lose, this is defined as another outcome. We can 
ine A situation whose outcomes, thus defined, are 


(c’) inevitably chooses a portfolio which is efficient in terms of mé 
probability of loss, 


then we know that the second individual has a utility curve of 
shape. An inspection of the utility curve associated with variance: 
it to be reasonable over a wide range of returns, An inspection 
utility curve associated with the “probability of loss” shows us 
certain portfolio situations the measure would have quite ui 
implications. 

Thus the inspection of the utility curve associated with a measure) 
can direct us to peculiarities in its evaluation of portfolios. In 
problems as well, such as the extent to which non-portfolio fa 
be reflected in a portfolio analysis, the expected utility maxim 
framework of analysis. 

‘When we argue the pros and cons of the expected utility maxim, 2” 
not directly arguing the pros and cons of particular portfolio # 

Rather we are arguing about the rules to be used in evaluating all BY win a car: 
types of analyses. In so faras it is applicable, the expected utility i) win $1000; 
provides a new viewpoint on the choice of criteria for the Wii) win both a car and $1000; 


portfolios. (iy) not win anything. 
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It will be convenient to assume that there are only a finite number 
‘outcomes. This is not a serious practical limitation on the analysis, 
ncan equal a million, a billion, or more, We may represent a prol 
distribution among the n possible outcomes by a vector 


be interpreted as either a chance of P or Q, or as a new vector of 
bilities obtained from vectors P and Q by the ordinary rules of 
Igebra 

shall consider a “rational man’ who has preferences among, 
bility distributions. If P and @ represent any two probability 
fibutions the rational man cither prefers P to Q, or prefers Q to P, or 
Jers both equally good. 

‘A wei of preferences may or may not be in accord with the expected 
iy maxim. If preferences are in accord with the expected utility 
im, there are numbers 


P= (Py Pw’? Pw’ Pas 


where p; is the probability that the ith outcome will be the one to 
Since one and only one outcome is possible, 


Ya=l- 


We can think of two probability distributions, 


Hyy May Hay * yy 
that the probability distribution 


(Pus +s Pad 
red to the probability distribution 


P=(py** spn) and Q=(qy°* 19) 


as “lottery tickets” offering different probabilities of outcomes. 
imagine flipping a coin, then engaging in lottery P if heads apj 
lottery Q if tails appears. If this is done, the probability of obtai 
ith outcome is 


Quy" 590) 
Lupe 
Dude 


Weighted sum Su,p, describes the individual's preferences among 


ibility distributions. Preferences which cannot be thus described 
Hat in accord with the expected utility maxim. 


(the probability of engaging in lottery P) times 
(the probability of obtaining outcome i if P is engaged in) ph 
(the probability of engaging in lottery Q) times 

(the probability of obtaining outcome i if lottery Q is enga 


This equals 
(1/2)p; + 1294 


In general, if there is a probability a of engaging in P and a 
(1 =a) of engaging in Q, then the probability of obtaining the ith 
is 


Suppose that a set of preferences can be described by the weighted sum 


U= 3p + 4p, + Ss. 
very same preferences can also be described by the weighted sum 


op. + (I = ae 


The probability distribution of ultimate outcomes associated 
probability a of P and (1 — a) of Q is therefore the probability dis 
indicated by the vector 


lapy + (1 — a)gy, apy + (1 = a)qay* > *s aPn + (l — a)Pale 


U’ = 6p, + 82 + l6py 


feason is that U' = 2U: 


3p, + 4pe + 8pq_is greater than 39, + 4g, + 89, 
Pd only if 

6p, + 8p + 16p5 is greater than 69, + 8q; + 1643. 
fe)iees which may be described by 


But this is exactly the vector 
aP + (1 — a)Q, 


using scalar multiplication and vector addition as described in 
VIII? Thus 
“aP + (1 — a)Q” 


Pt 187-158. 
ear; U= 3p, + 4p + Bp 
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can also be described by 
UT = 5p + 62 + 10py. 
Since, with p, + Pe + ps = 1, U" =U +25 
3p, + Apa + SPs = 3m + 492 + Bt 
if and only if 
Spy + Ops + WOps > Sqr + 692 + 109s. 

In general, if utilities 1, up, uy,* * *; Hq describe preferences, so do Wl 

uj=atby, 

uh =a + buy 

uy =a + buy, 


ul=a+ bu, 


where a is any number and b is any number greater than zero, 
and w’ thus related, 


Yup, > Dug, and only if Luip, = Duige 


an u! = Dwi = Leda t+ bu) 
=aypit 6S pias 
=a+dU, 
But, since b> 0, : 
Uj > Uj. if and only if, correspondingly, 
UY, 2 Uy) 


Returning to the example with w = 3, u, = 4, and uy = 
that it were desirable to describe the same preferences in such @ 
the utility attached to the first outcome equaled zero, w! that 
to the third outcome equaled 1. If we first subtract 3 from ei 
and then multiply each utility by .2 we obtain the utilities uy = 0, 
and uj = 1, The u’ describe the same set of preferences as the u 
the desired utilities equal to zero and one. 

In general, any outcome can be assigned utility equal to zero, 
outcome can be assigned utility equal to one. Ona utility Curves 
that in Figure 1 for example, the level of return with zero utility 
(higher) level of return with unit utility can be chosen arbitra 
zero and unit of the utility scale, therefore, is a matter of convey 
convenience, 4 
It is shown in the footnote that, once the zero and unit of 
scale are chosen, the rest of the utility curve follows from the i 


JARACTERISTICS OF ExPecTED UTiLITy MAXIMIZATION 


This sec 
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erences. No more points may be arbitrarily selected without affecting 


preferences represented by the curve. 


mn considers attitudes toward money risks by an individual 
maximizes expected utility. To postpone problems of utility maxi- 
ition over time, we consider only chance situations whose outcomes 
ietermined “immediately” after the choice of probabilities is made. 


We consider situations which, except for this matter of timing, are like 


froblem of buying or not buying insurance: a choice must be made 
her to take 


{)) a small loss with certainty, or 
fia 


rge chance of no loss versus a small chance of large loss. 


ore that outcome 1 assigned zr wily; outcome 2s assigned tit equal 
shall consider the utilities associated with: a third outcome preferred to the 
Wl not to the second; a fourth outcome preferred to both the first and second; 
4 fifth outcome preferred to neither the frst nor the second, 

uleome (2) is preferred to outcome (3), outcome (3) is preferred to outcome (1), 
ifthe individual acts according to the expected utility maxim, then there is a proba- 
fo wch that “a probability p of getting outcome {1) and a probability (I — p) 
Ning outcome (2)" is exactly as good as getting “outcome (3) with certainty.” 


My = ply + (1 ~ plug = pO) + (1 — pl) = 1p. 


uy = qua t (1 — 9); 
T= 9%, 
yams, 
2a) 


f outcome (1) is preferred to outcome (5), then there is a probability x such 
“+ (wobability r of (2) versus (1 — r) of (5)" is exactly as good as “certainty of (1).” 


ty =r + ts; 
O=r tll —rluy 


Ws = 


| Wie the zero and unit are chosen, the utilities attached (0 other outcomes follow 
‘he individual's preferences, 
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We sce that, even if odds are such that on the average the “ins 
company” makes money, the individual who maximizes expected 
may buy the insurance rather than take the chance. We similarly 
situations which are like the problem of choosing a diversified 
diversified portfolio, or that of buying, or not buying, a lottery ticket, 
sec how the individual's propensity to assume such risks depends 
utility curve. 

A curve shaped like i!’ in Figure 2 is said to be concave; one 
like ss‘ is called convex. The property which identifies a concave 


Figure 2. Convex and concave utility curves. 


(or a concave segment of a curve) is thata straight line drawn bet 
points on the curve everywhere lies on or below the curve. If it 
lies below the curve everywhere, as it would in the case of il’, we 
more strongly that the curve is strictly concave. With a convex 
straight line drawn between two points of the curve lies on or al 
If the line lies above the curve everywhere, it may also be refer 


strictly convex. 
Consider an individual who acts according to the strictly 
utility curve /’. Suppose that he has the choice of either 


(i) a 50-50 chance of a gain of $(a) or loss of S(a); or 
(ii) the certainty of no change. 


‘The expected income in both cases is zero. The expected utility 
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(by the convention used in drawing Figure 2). The utility attached 
(i) is: 


(1/2)U(+a) + (1/2)U(—a). 


Js equals the level of utility at the point b midway between the points on 
qurve corresponding toa and —a. Because of strict concavity, the 
iy at bis less than zero—the certainty is preferred to the uncertainty. 
‘The same sort of geometric argument will show that an individual with | 
‘Pineave utility function will prefer to pay a small premium rather than 
ya small chance of a large loss. This is even true, up to a point, when 
{pected money loss of the risk is less than the money amount paid as 
jum. In other words, the individual, depicted by utility curve 1’, 
\\i prefer to insure against large losses even if the insurance company 
ies some profit. 
Wippose that the random variables R and S are the returns on two 
vent portfolios. Suppose further that, although the probability 
Jibution of R and S may not be the same, the individual considers 
to be equally good. It is shown in the footnote that, if R and S are 
perfectly correlated and if the utility function is strictly concave, then 
‘Wited portfolio with returns T = (1/2)R + (1/2)S is preferred to R or S. 
Wi K and S are perfectly correlated, T can be no worse than R or S. 
| i an individual's utility curve is strictly concave, diversification 
‘seen two equally good portfolios cannot produce a worse portfolio 
erally will produce a better one,? 
‘A concave utility function, therefore, is a conservative one, consistent 
the purchase of insurance and the diversification of portfolios. 
ie implications of a convex utility curve are opposite those of the 
ve curve. The utility of the bet —a versus a, represented by c in 
2, is greater than that of no bet. An individual who maximizes the 
(ed value of the utility curve ss’ would not buy insurance, even if the 
ance company made no profits, He would prefer the probability 
{bullion of R or Sto that of the diversified portfolio 7. Action based 
} Convex utility curve is even more reckless than that based on the 
(itation of expected return—in contrast with the conservative 
lions of the concave curve. The expected return rule itself is the 
# (he maximization of expected utility when the utility curve is a straight 


PUT) wm EUL(1/2)R + (1/2)S] 2 (1/2)EU(R) + (1/2EU(S) = EU(R) = EUS). 
‘PtH strict concavity, 
ULL 2)R + (1/2)S] > (1/2)UCR) + (1/2)U(S) 


RES. 
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‘Vhree examples are given below in which it is possible for an individual 
prefer an alternative inconsistent with the expected utility maxim, In 
case the “wrong” alternative has a plausible appearance and is 
fn by many persons questioned. The first example is one of the 
\hor's quoted in {25}. The other two are due to Allais [29]. The 
lusion which Allais drew from his examples is that, since reasonable 
i choosing among simple alternatives contradict the expected utility 
Him, this rule must be a poor one. The conclusion which this writer 
was that the individuals choosing the “wrong” alternative acted 
Wionally. The argument to support this contention and similar 
ments! for the two examples of Allais are presented. 

ampte 1: The individual is given his choice of three lottery tickets. 
lottery A there is one chance in a thousand of winning $1000; in 
itty B there is one chance in one hundred of winning $100; in lottery C 
ff \s one chance in two thousand of winning $1000, and one chance in 
hundred of winning $100. The alternatives may be summarized as 
Ws 


A utility function need not be everywhere concave or everywhere co} 
The utility function in Figure 3, for example, is first concave, then 
then concave again. A curve of this sort can be consistent with insil 
against losses, taking small bets, and the diversification of the portf 


u 


Return 


| 
| 


Alternative Chances (out of 2000) Outcome 
h betting, and diversi & Z A) 

Figure 3. A util tent with insurance, betting, and diversi 
igure utility curve consistent with insu 3 ibis 3 
Utility functions with convex segments raise serious computing pI B 20 $100 
in portfolio selection. On the other hand, while concave utility fu 1980 0 
differ in their adaptability to computation, all are manageable c 1 $1000 
fashion. In the case of the allocation of “important money,” it * ea 
reasonable to use a strictly concave utility function. This is equival a “ 

1 


assuming that it is always at least as desirable to obtain a return 


ming. the : ‘ 
cattainty.as itis to take. a chance with expected outcome equsiilS cls asked to choose between A, B, and C frequently express a definite 


fence for C. This preference for C is contrary to the expected utility 


CONTRADICTIONS Alnce 


1989 


‘The writer feels that the most interesting and relevant arguments (ae 1 
* 5900 U(S1000) + 3 U(S100) + 355 USO) 


the expected utility maxim involve specific eases in which human § 
after careful deliberation, choose alternatives inconsistent with the 
The situations are reasonably simple, the human choice fairly defin 
contradiction between choice and maxim apparently inescapable, 
we must conclude that the expected utility maxim is not the ci 
rational behavior, or else we must conclude that the human being 
natural propensity toward irrationality, even in his most rel 
moments. 


+ See [23], [24]. 


a 999 17 10 99 

*5 los (81000) + v0] + sae U(S100) + Te vo 
| 1 

Ua t 5 Ue 

Us cannot be larger than both U, and Uy. 


| Voventially due to L. J. Savage. 
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found that his subjects preferred alternative A to B. He further 
wad them to consider the following: 
Situation C: receive $1,000,000 with probability .11. 

receive nothing with probability .89. 
Situation D: receive $5,000,000 with probability .10, 

receive nothing with probability  .90. 


Which is wrong, the expected utility maxim or the individu 
chooses C? 

Those who respond with a preference for C may be further interet 
whether they prefer A to B or vice versa. The argument is essential 
same whichever is chosen. For concreteness we shall assume that 
preferred to B. Remembering that the individual prefers A to B, we 
further inquire whether he would rather have A with certainty or a 
chance of obtaining A or B. In other words, shall we engage in loi 
directly or shall we first flip a coin to decide between A or B? 
unanimous response of those who prefer A to B is that they would 
to have A outright than a chance of A or B. 

The alternative of flipping a coin between A or B has certain probal 
of ultimately gaining $1000, $100, or nothing. The probability of wi 
$1000 equals the probability of obtaining lottery A times the prol 
of winning $1000 if A is obtained. Thus 


Eimeria A 
sion“ 22000 2000 


found that his subjects preferred D to C. If we assume that it is the 
hubilities of final outcomes and not the particular manner in which 
fire generated that count, then to choose A when A and B are the 
Wwatives and to choose D when C and D are the alternatives contra- 
the expected utility maxim. This may be seen as follows: letting 
l, U(1), U(S) represent the utilities of receiving nothing, $1 million, 
$5 million, respectively, we find that the preference of A over B implies 


U(1) > .1U(5) + .89U(1) + .01U(0). 


iy 
Similarly, 8 


[89U(0) — .89U(1)] 
With sides of the above inequality, we find that 


AILU(1) + 89U(0) > .1U(5) + .9U00). 


And, finally, 
Poo = | — Psio00 — Psio0 

_ 1989 

=o 
Note that the probability distribution of outcomes from flipping the 
to choose A or B is exactly the same as the probabilities offered 
Thus, if we assume that it is the probabilities of outcomes which 
rather than the particular way in which they are generated, we may 
the individual's preferences as follows: 

Cis preferred to A; A is preferred to a 50-50 chance of A or By 
50-50 chance of A or Bis exactly the same thingasC. Thus Cis defi 
preferred to A and A is definitely preferred to C—an inconsistency, 

EXAMPLE 2: Alllais asked subjects to consider the following 
alternatives:! 

Alternative A: receive $1,000,000 with certainty, 

Alternative B: — receive $5,000,000 with probability .1, 
receive $1,000,000 with probability .89, 
receive nothing with probability 01. 


we, if an individual who follows the expected utility maxim prefers A 
Wi he must prefer C to D. Ailais concludes that, since reasonable men 
wed A to B and D to C, the expected utility maxim must be 
sonable, 

1 punnot further interrogate Allais’ subjects, but I have found that my 
WWapues and acquaintances give similar responses. 1 have further 
Jioned them as follows: which would you choose 


{9} $1,000,000 with certainty, or 
{1} 5,000,000 with probability 10/11 and nothing with probability 1/11? 


ier response in itself is consistent with the expected utility rule. In fact, 
ver, everyone indicated a preference for the $1,000,000 with certainty. 
i consider the following situation. 

A wheel with numbers I through 100 (each equally probable) is spun. | 
iy number from 1 through 89 appears, you receive $1,000,000. If, 
ad one of the eleven numbers, 90 through 100 appears, then you have 
Felwice: you may either 


18) have $1,000,000 with certainty, or 
(1) have a 10/11 chance of $5,000,000 and a 1/11 chance of nothing. 


1 Thave taken liberties in translating fram francs to dollars. 
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This situation is never to be repeated. The possibilities and alter 
may be represented schematically as follows: 


Hut now let us calculate some probabilities. Suppose that in the first 
ion the individual decides to take (a) if the choice arises. Then he is 
in of winning $1,000,000. Thus the person who commits himself to 
| the first situation faces the same probability distribution of final 
lieomes as did the person who chose alternative A in the Allais example. 
ppose that the individual instead commits himself to (b) in the first 
Wation. Then he has a .89 probability of winning $1,000,000; a 
11\(10/11) = .1 probability of winning $5,000,000; anda (.11)(1/11) = .01 
ability of winning nothing, Thus the individual who commits him- 
fo (b) in the first situation faces the same probability distribution of 
| outcomes as did the person who chose alternative B in the Allais 
Mple, Suppose that the individual commits himself to (a) in the second 
Wuition. ‘Then he has a .89 probability of winning nothing and a .11 
ibility of winning $1,000,000. His probability distribution of final 
womes is the same as that of alternative Cin the Allais example. And 

ily, as the reader may confirm, if the individual commits himself to (b) 
Mwation 2 his probabilities of final outcomes are the same as if he chose 
Wnative D in the Allais example. 
Thus, if the individual prefers to take the million with certainty when the 
tunity is presented—whatever the non-occurring alternative—then 
‘Pelers the probability distrib of alternative A to that of alternative 
‘nd prefers the probability distribution of alternative C to that of 
native D. The person who preferred D to C in the Allais example 
fefers the probability distribution of C to D, when this is presented 
‘rms of a random process in which the meaning of the difference 
jeen a .11 and a 10 probability is made sharper. 


189 probabil 


{$1,000,000 with certanrty 
(2) $1,000,000 with certainty 


Choice 


Suppose that one of the numbers 90 through 100 appears on the 
which would you choose, the million with certainty or the chance? 
response was invariably “'the million with certainty.” Suppose 
to commit yourself in advance. Suppose that, before the first wI 
spun, you had to say “Mf will take (a) if the choice arises” or “I will 
if the choice arises.” To which would you commit yourself? 
unanimous preference was to commit oneself (‘*of course”) to (a).! 
Consider another situation: a wheel marked with the equally 
numbers 1 through 100 is spun. If one of the numbers I thro} 
appears on this spin, you receive nothing. If, instead, one of the 
numbers 90 through 100 appears, then you have the choice: 


(a) $1,000,000 with certainty, or 
(b) a 10/11 chance of $5,000,000 and a 1/11 chance of nothing, 


Schematically: 


{89 probabiity ‘$00 with certainty 


sampe 3: In the second example of Allais we again have two situa- 
i, each with two alternatives, In the first situation we may choose 


(a) $1,000,000 with certainty fen 


Choice or 


Alternative A: certainty of $1,000,000. 
rd 10/11 probabilty 


: Alternative B: a .98 probability of $5,000,000, and 
prota; —= 00. a .02 probability of obtaining nothing. 


As before, the situation is never to be repeated. Suppose that one 
numbers 90 through 100 appears on the spin; which would you 
the million with certainty or the chance? Again the response is una 
the million with certainty. Suppose that beforehand you had to 
yourself to (a) or (b). Again (a) is chosen. 


the second situation we may choose between: 
Alternative C: a probability of .01 of getting $1,000,000, and 
a probability of .99 of getting one cent. 
Alternative D: a probability of .0098 of getting $5,000,000, 
a probability of .0002 of getting nothing, and 
a probability of .99 of getting one cent. 


} This is a single stage choice situation, Once the individual commits 
(@) or (b), no decisions are made before the final outcome is announced. 
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Allais found that reasonable men prefer A to Band DtoC, I leave 
an exercise for the reader to prove, as was done in Example 2, that 


choices are in fact inconsistent with the expected utility maxim. A basic assumption of the expected utility maxim is that, if probability 
also confirm that the probabilities associated with C and D are the ANibution (1) is considered as good as probability distribution (2), and 
as those associated with the two possibilities in the following sit is considered as good as probability distribution (3), then (1) will be 
A wheel with 100 numbers is spun. If any number between 1 a Nidered as good as (3). This may not hold true in fact. By analogy 
appears, the individual receives one cent. If 100 appears, the indi Wider the perception of loudness: sound (1) may seem exactly as loud 
has the choice of accepting one million with certainty or taking a ¢) Sound (2), and (2) exactly as loud as sound (3), yet, when sounds (1) 
with (3) are compared, the first may seem definitely louder than the second. 
it/) surely have as much fuzziness in their perception of preferences as 
y have in their perception of sounds. When we assume that a rational 
\ exactly follows the expected utility maxim, we rule out any fuzziness 
Nis perception of preferences, 

The last section considered cases in which human choice was not 
istent with the expected utility maxim. We argued that, in these 
W\icular cases, the person was in error. Other cases can be cited, 
wever, in which human choice apparently contradicts the expected 
illy maxim but the person is not “in error.” These include cases in 
Nich the pleasures or displeasures of suspense, surprise, and the fun of 
ying the game are an important part of the individual's motivation. 
‘expected utility maxim can be extended to include such considerations; 
(0 large number of such “extensions” transform the maxim from a 
Wenient rule to a useless formality. 

When we assume that utility is attached to money outcomes, we are 
\ming that any way of generating probabilities is as good as any other | 
iy of generating the same probabilities, In fact, however, the “fun of 
bling” may be greater if probabilities are generated by the spin of 
els than by the use of a random number table. Even if there are no 
‘Wit, cards, wheels, or companions to liven the game, suspense may be 
‘Wifeascd by employing a sequence of drawings to determine an outcome 
{het of seven rather than heads or tails). 
‘The assumption that, if outcome A is better than outcome B, then it is 
‘i better than having a 50-50 chance of A or B is not always true of 
‘Huinan preferences. 1 may prefer to receive socks for my birthday rather 
‘an a tie, yet [ may insist on not revealing my preferences so that I may be 
“Wiprised” when my birthday arrives. I might be equally delighted to 
$ave apple pie or pecan pie for supper, yet I may feel it worth the effort to 
jell and find out which is in the oven, 

The expected utility rule can be extended to incorporate considerations 
‘Ju)) Ws surprise, curiosity, suspense, and the fun of playing the game. We 
‘ld attach a different utility to “watching wheels go round and then win- 
‘Wg 4 jackpot of $20” than is attached to “finding $20 on the sidewalk. 


HOEPTIONS, ASSUMPTIONS, AND EXTENSIONS 


a probability of .98 of $5,000,000 


and 


a probability of .02 of nothing. 


Surely the individual who originally preferred alternative A to altel 
B will again choose the $1,000,000 with certainty if the occasion 
But, again, as the reader may confirm, this is the same as preferrit 
probability distribution of C to that of D, contrary to the pi 
expression of preference. 

What may we say concerning the individual who in Example | pt 
alternative C to A or B, yet preferred A to a 50-50 chance of A 
What can we say about the individual who, in either Example 2 
preferred A to B and D to C, yet chose the million dollars with 
whatever the contingency which was not realized? One of two cot 
can be made, depending on whether the individual accepts or tej 
statement that choice should depend on the probabilities and not o} 
they are generated. If he rejects the statemen 
the probabilities, not on how they are generated,” he must feel that 
either more or less worth while if numbers are drawn from a single 
and a million dollars received than if numbers are drawn from two 
and a million dollars received. If this is the case, then there is no 
between the expected utility rule and the person's choice. Utility 
attached not to the one million dollars but to the enjoyment of a 
kind of lottery and the receipt of the sum. 

If, on the other hand, the individual accepts the statement that 
Practical purposes it is the probabilities of the outcomes rather # 
way they are generated that count, then our conclusion must be # 
individual has simply made a mistake in one of his choices. WI 
probabilities are presented in one manner, he prefers one distributic 
second; when they are presented in a different manner, he prel 
second to the first, His preferences are not only inconsistent wil 
expected utility maxim, they are also inconsistent with themselves, 
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We could attach a different utility to asking for socks and getting 
than is attached 10 wondering whether socks or ties are forthe 
and being pleasantly surprised to find the fatter. We could a 
different utility to “anticipating the pleasures of apple pie and 
presented with pecan” than is attached to “anticipating the pleas 
pecan pie and being presented with pecan.” 

By thus elaborating the set of outcomes we can remove the diffe 
between human preferences and the expected utility maxim. 

The fun of playing the game is no doubt important in actual gal 
situations. It may also be important for persons who take pleai 
following their security investments, especially when these rep 
small fraction of their earning power. We shall assume, howevel 
the “fun of the game” can be ignored in deciding on a rationale 
selection of a portfolio, especially when this involves the alloc 
large amounts of other people's money. 


| would prefer to go to the mountains next Saturday than stay at home; 

| would prefer to go to the seashore next Saturday than stay at home; 

Yet I would prefer to stay at home than to be uncertain as to whether I 
poing to the mountains or the seashore. 


\ preferences illustrate the effect of intermediate decisions. It would | 
\npleasant to prepare for the mountains and find that one was going to 
weashore. It would be equally unpleasant to prepare for the seashore 
find that one was going to the mountains. To prepare for both 
iid be more trouble than the trip is worth. The effect of these inter- 
inte decisions is clear when this situation is contrasted with one in 
wh no intermediate decisions are required, or in which the same 
mediate actions must be taken in any case. For example, | 


T would prefer to go to beach A than stay at home; 
J would prefer to go to beach B than stay at home. 


same preparations must be made in either case, Surely 


Some “contradictions” between human preference and the ¢) 
utility maxim are not contradictions at all, but misapplications 
interpretations of the maxim. It is important to avoid such err 
in deciding on the reasonableness of the rule and in applying it if 
A common error is the application of a single time period analy: 
essentially multi-time period situation. Another common error 
misinterpretation of the notion aP + (1 — a)Q 

Suppose that an individual is indifferent between 


J would prefer to be uncertain as to whether I was going to beach A or 
than to be sure that I was going to stay home. 


A second type of error is the misinterpretation of the idea of “mixing 
abilities.” Suppose that security (1) has a probability distribution of 
ms P; and security (2) has a probability distribution of returns Q. 
wwe further that P is considered to be exactly as good as Q. The 
ed utility maxim asserts that having either of these outright is 
‘ly as good as flipping a coin to choose between them. The proba- 
ily distribution in the latter case is the “mixture” (1/2)P + (1/2)0. 
hytes sometimes conclude that the statement “(1/2)P + (1/2)Q is 
ily as good as P or Q™ implies the statement “diversification does no 


(i) receiving an extra $10,000 one year from now, or 
(ji) receiving an extra $10,500 two years from now. 


He would not, as a rule, be indifferent between 


This is a non sequitur. The probability distribution associated with 
ing a coin to choose between two securities is not the same as the 
bility distribution which results from investing equally in each. 
», AS an extreme example, that there were one hundred securities 
with the same probability distribution. Suppose further that the 
iis on these securities were uncorrelated. Diversification among the 
hundred securities would considerably reduce variability of return. 
sing a single stock at random, on the other hand, would result in 
ly the same probability distribution of returns as investing in any of 
Outright. Clearly, the mixing of probabilities and the mixing of 
folios are not to be confused. 
Wi hias been argued that the assumption that “if P is exactly as good as Q | 
tither is exactly as good as aP + (1 — a)Q” should be doubted or | 


(a) knowing that (i) will happen, 

(b) knowing that (ii) will happen, and 

(c) knowing only that there is a $0-S0 chance of (i) or (ii), 
outcome to be revealed a year from now 


Situations (a) and (b) are clearly better than (c). Actions can 
between now and a year from now. if one or the other outcome is) 
with certainty, that cannot be taken in the state of uncertainty. | 
| not a contradiction of the expected utility rule but an example of 
time period situation. The application of the expected utility rule 
situations is discussed in the next chapter. 

At first sight the following seems like a contradiction of the 
utility maxim: 
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Part (b) of Axiom I says that we should be rather disappointed wi 
perfect machine if it told us that portfolio 1 is better than 2, portfol 
better than portfolio 3, and at the same time—with no change of in! principle. aP + (1 —a)Ris the ov 
tion or circumstances—it told us that portfolio 3 is better than port elf to choose P if the opportunity arises; 2Q + (1 — a)Ris the over- 
Part (b) also rules out fuzziness in the perception of preferences, jrobability if one commits oneself to choose Q if the opportunity arises. 
implies that, if P is exactly as good as Q, and Q is exactly as good ‘wf assume that different ways of generating the same probability distri- 
then P must be exactly as good as R. jonns are equally good, the statement that P should be chosen instead of 

Axiom I says that preferences define an ordering of probability | whatever a or R, is the same as the statement that aP + (I — a)R 
butions (just as age defines an ordering of people, hardness defi Id be preferred to aQ + (1 — a)R. 
ordering of objects, and luminosity defines an ordering of stars), Another way of looking at Axiom I! is the following: In alternative 1a 
various properties of orderings can be deduced from Axiom I. We iis flipped: if it comes out heads the lottery Pis chosen; if it comes out 
not assume separately, for example, that, if P is considered bet then lottery Ris chosen. In alternative 2 a coin is flipped: if it comes 
Q, and Q is considered exactly as good as R, then P is considered heads lottery Q is chosen; if it comes out tails lottery R is chosen. 
than R. This follows from Axiom | without additional assumptiol tabular form: 

The next principle concerns situations similar to those in which 
is flipped to determine in which of two lotteries to engage. The 
is expressed in 


@ whatever the alternative that did not occur and whatever the proba- 
ly with which it did not occur. Axiom Il is simply a formalization of 


ifheads if tails 


Alternative | 
Axiom Il: If probability distribution P is preferred to probability dist 


Q, and if R is any probability distribution at all, then Alternative 2 


a probability @ of obtaining P and (1 — a) of obtaining R is preferred 
a probability a of obtaining Q and (1 — a) of obtaining R—as long 

not zero. 

In other words, if P is preferred to Q, if a is greater than zero, and if R 

distribution whatever, then 


aP+(l—a)R is preferred to oQ + (1 — o)R. 


Mative | is surely superior to alternative 2. For, if a head appears, 
(native 1 results in the choice of a preterred distribution; whereas, if 
Appears, they both result in the choice of the same distribution. The 
iment applies even if the probability of a head is a rather than 5. But 
Jrobability distribution associated with alternative | is aP + (1 — a)R, 
ie that associated with alternative 2isaQ + (1 —a)R. Thus Axiom Il 
fothing more than a formalization of the statement that, if P is better 
i) Q, aiternative 1 is better than alternative 2. 

1) Axiom If the probability distribution R can be any probability 


Axiom I requires a word of explanation. Previously we co 
situations in which there was a probability (1 — a) of receiving prol 
distribution R and a probability @ of having a choice of P ve 
Schematically: 


y* Wibution. In particular, we can have R = P, In this case the axiom 

that, if P is preferred to Q, then P is preferred to aP + (1 —a)Q. 

cy Fe wecan let R = Q,in which case the axiom says that, if P is preferred to 
g ) then aP + (1 — a)Q is preferred to Q. 


Jet us consider a situation with three possible outcomes whose proba- 
Willies are py. pas py. Ian individual's preferences are in accord with the 
‘preted utility maxim, they may be described by an ordering function such 
” 


It seems reasonable that, if P is preferred to Q, then it should be 


* Strange to say, even the assumption that “if P is better than Q, and Q is 
A, then P is better than R” has been objected to as a principle of rational 
‘The main argument is that human beings, and even rats, do not always 
according to this rule, Granting this immediately, 1 would still be rather she 
perfect machine did not follow it. 


U = 3p, — 2p + Bs. 
probability distribution is preferred to another if and only if it has a 


232 THE EXPECTED UTILITY MAXIM 233 


PORTFOLIO SELECTION 


higher U, as computed in equation (1). We can conceive of 
other ordering functions: 


minus this probability may be thought of as the chance that lightning 
sirike every telephone post in the country tomorrow, thus forcing 
| 1 & T. into bankruptcy. I am sure that most people who are in the 


Q) Um log, + 2 lop p, + 4108 Pn, for a cigarette would be willing to put up with this much tisk, 

3) U=Py" Pa’ Pw ‘The third of our axioms rules out discontinuities as illustrated by our 
(4) U = 5p, + 4paps, te example. We shall actually present three versions of a third 
6) Van + P+ (A wn: III, 11’, and III", It is unnecessary to assume all three. Axioms 


F Hi, and 111 have exactly the same implications as Axioms I, II, and IIT’. 
and so on, Ordering functions (1) through (5) are continuous; implications of Axioms I, II, II!” differ from the preceding in only one 
changes in the ps do not lead to jumps in U. respect. Axiom IITis the easiest to use. It is the version employed 
OF the five continuous functions listed above, only the first 1 in this chapter to show the relationship between the axioms and the 
Axiom II. In fact, of all possible continuous ordering functions t (ed utility maxim. Axiom III’ seems to me more fundamental, but 
‘ones which satisfy Axiom II are weighted sums. More difficult to work with than Axiom III, We state it here as an 
ative formulation; we show in Appendix C that it may be used in 
of Ill. Except for one small restriction, Axiom III” would be 
biguously superior to Axiom III, As with Axiom III’, we simply 
it here as an alternative formulation and discuss it technically in 
fdix C. Appendix C also considers other matters related to the 
iM systems, such as the possibility of an alternative formulation of 
mii, 
“141 us begin with 
‘Aviom III's If probability distribution P is preferred to distribution Q, and 


U] ee to probability distribution R, then there are numbers a < | and 
such that 


w= Sup) 


and disguised versions of linear functions; for example, 
V = Ut =(SUp). 


More precisely, if a set of preferences 


(a) can be described by any continuous ordering function, and 
(b) satisfies Axiom Il, 


then these preferences are in accord with the expected utility 
[Axiom Lis implicitly included in the assumption that an ordering 
exists. The proof that (a) and (b) imply the expected utility maxi | of + (1 — a)R is preferred to Q, 
corollary of the main result of the following sections.) d ) Q is preferred to bP + (1 — b)R. 

It is possible to construct preference patterns not describabl 
continuous ordering function. Suppose that an individual pre 
smoke a cigarette rather than not smoke one; and prefers not to 
than be struck dead instantly. He must choose between 


Wher words, if P is preferred to Q, and Q is preferred to R, there is 


1)» probability a—perhaps infinitesimally close, but not quite equal 

| such that this much chance of P instead of R is preferred to QO with 
ity; and 

1) & probability b—perhaps .00000000001 or smaller—such that this 

ehance of P is not as good as having Q with certainty. 


(i) smoking a cigarette with probability a and being struck dei 
probability (1 — a), or 
(ii) having neither happen with certainty. 


‘Aviom 111’ formalizes the argument against discontinuities presented in 
‘Siparette example. 

‘Ai axiom which is easier to use says that, between the value of a that 
aP + (1 —a)R preferable to Q, and the value of b that makes 
+ (= 6) less desirable than Q, there is a value of ¢ such that 
4 () ~ ©)Ris exactly as good as Q. The existence of c can be deduced 
Axioms I, II, and III’, It is convenient in the text, however, to 
-99999999999999999999999999999999999999999999999999: \ts existence from the outset. Thus in the discussion of the 


Suppose that the individual chooses (ii) for all values of a less than 
chooses (i) fora = 1. His preferences cannot be described by a con 
ordering function. 

But people do cross busy streets to buy a package of cigarettes, 
other ways affect the probability of death. When we say “any” 
bility other than **!"" we include probabilities such as 
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relationships between the axioms and the expected utility maxim we Milow. What is the relationship between these principles and the expected 
use Wility maxim? 
Axiom IIL: If P is preferred to Q, and Q is preferred to R, then there The relationship is as close as a relationship between a maxim and a set 
number c such that Wf principles can be. Ifa set of preferences is consistent with Axioms I, 
cP + (I — OR is exactly as good as Q. 1) and 111, then it is in accord with the expected utility maxim. Ifa set 
Wf preferences is in accord with the expected utility maaim, it is consistent 
Wilh the axioms, An individual acts according to the axioms if and only 
Wf he acts according to the maxim. If we understand the conditions and 


‘8 irements imposed by the axioms, derstand th tions 
(a) There is at Jeast one set of three outcomes such that outcome | is pi Bins hei orate utility. ahs co mad ied 


to outcome 2, and outcome 2 is preferred to outcome 3. Hae 
(b) In the case of every triplet of outcomes (i, j, and k) such that outcoi If we were to add other principles, not implied by those of the last 
preferred to outcome j, and outcome j is preferred to outcome k, there Welion, to our set of axioms we would further limit the sets of preferences 
number ¢ such that PHistent with our axiom system. If, for example, we assume that 
probability ¢ of obtaining outcome i plus ‘Pyersification between two equally good portfolios never produces a worse 
a probability (I — c) of obtaining outcome (k) PHifolio, then only concave utility curves would be consistent with our 
is exactly as good as (/) with certainty, Peloms, In any case, a set of preferences consistent with all three of the 
‘Prioms of the last section must be in accord with the expected utility 
maim 
The rest of this chapter is devoted to showing that the expected utility 
tim follows from Axioms 1, 1, and II], First we discuss notation and 
Heh discuss relationships useful in deriving the maxim. Finally we show 
Wl the maxim follows from the axioms through a series of intermediate 
outs 


Essentially the same implications which follow from Axioms I, II, al 
also follow if we assume Axioms 1, If, and III’, when Axiom Il} 
defined as: 


The statement that outcome iis preferred to outcome / is the same 
statement that the probability distribution which gives é with cert 
preferred to the probability distribution which gives j with cei 
Axiom III makes an assumption concerning any probability distri 
P preferred to Q preferred to R. Part (b) of III” requires only that 
assumption, the existence of c, holds true for probability distriby 
which give outcomes with certainty. Thus part (b) of III” assumes 
Jess than II]; yet under certain conditions it has exactly the same it 
tions. These conditions, as expressed in part (a) of III", are that at Noration 
one triplet of outcomes exists such that outcome | is better than outer 
and outcome 2 is better than outcome 3. 
We have wandered from the objective of setting down a set of prin 
and deducing their consequences. We are now in the realm of stal 
with one set of reasonable principles which give certain consequences 
then searching for slightly more reasonable, slightly simpler, slightly 
general principles from which the same conclusions can be di 
While such excursions in “comparative axiomatics” may be interes! 
they are somewhat removed from portfolio selection. The mathemati 
trained reader who wishes to pursue the subject is referred to Appendi 
For the rest of this chapter the fundamental principles used in anal 
rational behavior are those expressed in Axioms I, II, and II]. 


/, Q, R will represent, as above, vectors describing probability dis- 
Wibutions among n possible outcomes, Anything which is true in general 
Piwerning probability vectors is true of our P, Q, R. 

We shall let the symbols >, ~, > represent relationships among 
Preference 

P>Q means “P is preferred to Q.” 

?~Q means “P is exactly as good as 9.” 

P= Q means “P is at least as good as Q.” 


1) an axiom system some symbols are “‘undefined’’; others are defined in 
Shims of these undefined symbols. Generally there is a choice as to which 
foheepls are “primitive” and which “defined.” In the present case we 
fiwid take > and ~as undefined ideas and then define “P > Q" to mean 
Wither P > Q or else P~Q." Alternatively, we could take > as the 
Wulefined symbol and let: 


'P~Q” mean “P>Qand Q>P"; 
"P>Q” mean “P>Qand not Q>P.” 


THE Axioms AND ExpecteD Uritity 


Axioms I, II, and III of the last section express principles which 
would expect our Rational Man or our Perfect Computing Machine 
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The first approach seems more natural; the second, which minimii 
number of undefined terms, would probably be preferred by the cont 
of axiomatics. We shall adopt the second since it is slightly 
convenient. 

Our formal axiom system, including axioms and definitions, 
expressed in terms of the relationship “>" as follows: 
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40 on, until 


P=q,E, + (1 —9,)P", 


4e0<9,<1. Forexample, 
P = (1/8, 3/8, 2/8, 2/8) 


Wy be formed as follows: 
Pi = (1,0, 0,0), 
PY = (3/4)(0, 1, 0, 0) + (1/4){1, 0, 0,0) = (1/4, 3/4, 0, 0), 
Pm (1/3)(0, 0, 1, 0) + (2/3)(1/4, 3/4, 0, 0) = (2/12, 6/12, 4/12, 0), 
Pi m= (1/4)(0, 0, 0, 1) + (3/4)(2/12, 6/12, 4/12, 0) 

= (1/8, 3/8, 2/8, 2/8). 


‘Axiom Ia: If Pand Q are any m component probability vectors, cither I 
or Q =P, or both, 


Axiom Ib: If P > Q and Q > R, then P > R. 
Definitions: 
“P > Q” means “P > Qand not Q > P." 
“P~0Q" means “P>Qand Q>P.” 
Axiom Il: If P > Q, then 

aP +(l—a@R>aQ+(1—a)R forany Rand | 2a>0, poneral, if we tet 
Axiom I: If P > Q and Q > R, then there is a number @ such that Pek 

Q~aP + (1— aR. ; 


Ta, above, is equivalent'to a formulation used previously, namely, ie = Ea es Oe ge 
‘Axiom la’: Either P > Q or Q > Por P~O. | Tiago La ale Bl Mfg eR arg t 
We shall let ee Ata pe ] 
E,=(0,0,- > + 1,-- 0) Pat Pet Ps Pt Pat Ps 
be the vector with 1 in the ith place and 0 everywhere else. Fy RB a Pipe ga Ny 
Athth AthtA AthtAa 


probability vector which represents the certainty of receiving the 
come, (Since we will not need £ to represent “expected return!” Bl natty arrive at 


chapter, this use of F; should cause no confusion.) P= pEy t+ + pki 
rE ny. 


Whvequently we shall consider a relationship o(P) between numbers v 
Vectors P = (p,, Pp, Pa.‘ * “+ Px) We will know some properties of 
‘lationship but will not have an exact formula for it. The following 
be valuable in helping us further characterize «(P): 

i for every pair of probability distributions P and Q, we have 


vlaP + (1 — a)Q] = a: oP) + (1 — a) (Q), | 
forO<a<l, | 


Preciminary Resucts 


Any vector 


P=(Py"*sPn) 
can be represented as an average of the vectors E, = (0,- * 5 
++ +,0). Specifically, 
(Piet Pa) = Prey + PaEa ++ «+ Pan: 
More important for our purposes is the fact that any P can be ol WP) may be written 
letting 


oP) = ¥ pat 


| is, if the v associated with aP + (1 — @)Q is always a times o(P) 
{1 = a) times 1(Q), then » may be expressed as a weighted sum of 


Ur 


Pi= By 
P? = E+ (1 —9)P?, 
P? = gE + (1 — 9))P%, 
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i) the preferences consistent with Axioms I, II, and III can be described 
weighted sum. If preferences are consistent with the axioms, they 
jn accord with the expected utility maxim, 

‘The converse result may be readily confirmed by the reader. 
Proor: 

1 A number of properties of ~, >, >, can be derived from Axiom I. 
include: 


i) P> Qand Q> Rimply P> Rk, 
) P ~Q and Q ~ Rimply P~R, 
P > Qand Q ~ Rimply P> R, 
) P~Qand Q> Rimply P> R. 


‘shall prove (a) by way of illustration. The reader should prove one 
an exercise. 

Woce P > Q, we have, by definition, P> Q but not Q@>P. Since 
Ry we have Q > Rbut not R>O. P>Q,Q>R, and Axiom! 
P>R. Suppose R>P; then P>Q would imply R>Q, 
ary to assumption, Therefore we have P > R and not R>P; 
i, P>R 

| We now show that Axioms I and II imply that, if P ~ Q, then 


[oP + (1 — pO] ~P. 
we, to the contrary, that 
[pP + (1 —p)Q] > P~Q, 


THE EXPECTED UTILITY MAXIM 
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PROOF: Define u; to be o(E). We saw previously that we may 
Pas 


P= p,Ey + (1 — pyP*, 


where 
Prt me (If — pal [Pa-tEn-i + (1 = par, ete: 


Repeatedly applying 
w(aP + [1 — a]Q) = a(P) + (1 — aXQ), 


we find that 
U(P) = upp + UL = pad CP!) 


= gpa + ManaPr-a + UL = Pa = Pantd HPPA) 


= Mapa t+ NPL 


Tue Derivation 


We now show that, if preferences are consistent with Axioms I, 
IH, they are in accord with the expected utility maxim. The 
proceeds through a series of intermediate results: 


1, First we note properties of orderings implicit in Axiom I. 
2, Next we show that Axioms I and I! imply that if P~ Q and 
number between 0 and 1, 


pP+(1—p)Q~P~9. 


3. Next, if P ~ Q, then 
pP+(1—p)R~pQ + (1 — p)R. 


4, IfP > Rand P > Q > R, then there is one and only one pr 
p such that 


by Axiom II, 
[Ph + (1 — pO) > (12)[pP + (1 — p)Q) + (11290 > O~P. 


O~pP + (1 — pyr. (DpP + C = pO) + 12)O = (A/2»p1P + [1 — (/2)p10. 


5. There are £’ and £° such that 
Boe Pi Ee 


((2)plP + [1 — (1/2)p]@ > P, 

implies, again by II, that 

Uy) + (1-57) 0] > ae [lee) + (1-42) o] 
+(- [alle 


‘Fapression on the right, however, is simply 
PP + (I — p)Q. 


That is, the most preferred and the least preferred probability dist 
involve E,, which give outcomes with certainty 
6. For every P there is one and only one probability u(P) such 


p~vE’ + (1— 0E*. 


7. vlpP + (1 = p)Q) = p: P) + (I = p) (0). 
The last result implies that 
AP) = Lupe 


THE EXPECTED UTILITY MAXIM 
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prove this in three parts: 


(i) i P~ Q, then II implies that only p=1 will make Q ~ pP + 
= pr, 

{) if R~Q. then IL implies that only p =0 will make Q ~pP + 
= pr, 

We) iP > Q > R, then IIL implies that there is some p such that 


pP+(l—p)R~Q. 


I} implies that for any p’ > p, p'P + (1 — p')R> Q, and for p" <p, 
WH true that 


Thus we have 
pP + (1—p)Q > [(l/2)lP + [1 — (1/2y]0 > pP + (1 =) 


a contradiction. 
The reader may show that a similar argument applies if we assum 


(pP + (1 — p)Qh <P. 
3. Next we show that, if P ~ Q, then 
pP+(1—p)R~pQ + (lpr 
Suppose, on the contrary, that in some instance 
pP+(1—p)R>pQ+(1—p)R. 
Because of (2) we know that we cannol have R~P. For then 


O>p'P+(l—pDR. 


4 Let £' be at least as good as any other certain outcome; let £° be 
(ter than any other certain outcome, Thus 


Q~P~pP + (1 —p)R~pO + (1—p)R~R EDE,>E 
Eee ascume Ul ily outcome £;, We shall show that, for any probability distri- 
Ree lion P, 
Essentially the same argument applies if P> R. Since ES PS EY 


R> pP+(1—p)R> pQ+(1—p)R>P~O, 
according to Axiom III there is a probability g, such that 
gp + (1 — p)R) + (1 —9)R~pP + (1 — p)Rs 


have already scen that P may be developed as follows: 
Pi=E,, 
P? = gE, + (1—4,)P!, ete, 


that is, 


pgd + (1 — pg)R ~pP + (1 — p)R. 
Let us use the g defined above to define a new vector (1 —q) 


P=q,£, +(1—g,)P™". 


BSP 


We have 
(L-g)R+qgQ>P~Q. ‘ 
Hence, by Axiom II EDP. 
Pll —g)R +90) + (1—p)R> pP + (1 — PR. E> EM, 
But ieee 
pl(l — 9)R + 9Q) + (1 ~ p)R = pg + (1 — pa)R 
Thus fore E’ > P, by induction. A similar argument shows that 


Pq@ + (1 — pq)R> pP +(1 — p)R, 
which contradicts our previous conclusion that 
pqQ + (1 — pg)R ~ pP + (1 — p)R. 
4. IfP > Rand P > Q > R, then there is one and only onep 
Q~pP + (1 — p)R. 


(EU Py 


4 Vor every P there is one and only one probability v such that 
P~oE' + (1 —vJE*. 


corollary of 4and 5. The number v as a function of P, WP), is 
‘Wering function in that (P) > o(Q) if and only ifP > Q. To show 
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that it is a linear ordering function, and that the individual's pret 
are therefore consistent with the expected utility maxim, we need onl 
7. w(pP + [1 — p]Q) = pe(P) + (1 — pe(Q). Let 


4 =P), 


= 1(Q). Cuarrer XI 
Then 
P~ oe’ + (1—re% UTILITY ANALYSIS OVER TIME 
Q~ E+ (1 — EE 
It follows from 3 that 


pP + (1 —p)Q ~ pl,£’ + (1 — ovE% + 1 — pileE + (1 = SENTATION 
But the expression to the right of the ~ is 


Now turn to the problem of rational behavior over time, still assuming 
[pe + (1 = plo’ + [= poy — (1 — prod ®. 


Objective probability distributions are known. The problem of 
Therefore WVior when probability distributions are not known is discussed in 
vfpP + (1 — p)Q) = pr, + (1 — pi 

= puP) + (1 — pg). 


Hence we may write U= xP), u = o(£,), and order the indi 
preferences by 


I Part 11 we used random wheels to represent chance situations. In the 
WM chapter it will be convenient to employ such wheels similarly. 


U= Spm /ppose that the individual must choose one and only one of the bands 
‘# wheel as in Figure 1. Associated with each band is a level of current 
imption (¢.g., consumption expenditures in constant dollars), Each 
on the band indicates the wheel to be spun next. Thus, if the 
ual chooses band 3, he receives C = 43 current consumption; and, 

wheel stops as in the figure, wheel number 14 is spun next. The 
y of “bands” and “places" may vary from wheel to wheel. There 
he wheels on which there is only one band, and therefore no choice; 
‘may be bands on which there is only one outcome. We assume that 

Wividual knows the characteristics of all wheel: 

Nv system of wheels can represent the probabilities associated with 
variety of situations. Consider first the case of an individual who 
his funds between consumption and securities. One band of our 

| would represent the decision to consume amount D and invest the 
ip portfolio A; another would represent the decision to consume 
it D and invest the rest in portfolio B; a third would represent the 
i to consume amount E and invest in A; and so on. The next 
| (o be spun (i.e., the next opportunities for consumption and invest- 
) would depend on the choice of band and the result of the spin. 
Wheels can also describe the probabilities associated with illiquid 
‘The bands on a first wheel might represent various combinations 
243 
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vidual dies at time 1° < T, every wheel after (® has only one band with 
possible outcome: stay dead. 
VWiychological considerations (such as the pleasures or displeasures of 
and playing the game for its own sake) can also 
san elaboration of the set of outcomes as 
ised in Chapter X, page 225. “Mood” is an important psychological 
Wwble in the present discussion. I may postpone decisions (such as 
iorrow's menu, tomorrow's entertainment, or tomorrow's total con- 
plion expenditure) because I cannot perfectly predict tomorrow's 
"or “tastes,” In keeping with this chapter's assumption of known 
bility distributions, we could assume mood to be a random variable 
|| known probabilities. These probabilities may depend on previous 
Wrences. The effects of mood, therefore, may be incorporated into 
Hulcome of our system of wheels. 


of consumption, security purchases, borrowing, purchases of real 
or other business decisions. The opportunities available the 
depend on the amounts of goods in process, obligations contracte 
of liquid assets, and so on. Different combinations would corres} 
different wheels. The wheels (opportunities) of the next period 
‘on the choice of band this period and the outcome of this period’ 


INITIONS 


Shall refer to “the world,” “strategies,” “outcomes,” “probability 
(butions of outcomes,” and the “expected utility maxim.” The present 
lion defines these terms for the current discussion, 

world may be thought of as either 


i) # set of wheels as described in the last section; or 
) the physical reality which they represent. 


Band Consumption or Bg nde 

A 5 “world,” in short, refers to the set of opportunities and possibilities 
Hime. 

2 eh Hivategy is a set of rules which prescribes action over time for all 

3 ils 43 ible contingencies. A strategy might, say, “Choose band 1 on the 


Wheel, If the second wheel is number 4, choose band 5; but, if the 
i) wheel is number 6, choose band 8... If in the first 19 periods 
nption has been C,,+ - -, Cyy and if wheel 32 represents the oppor- 
4 at time 20, then choose band 2. . .” 

irategy can depend on future mood. The strategy might specify, 
‘sample, that if at time 20 the individual is faced with wheel 32 
\eNling a certain financial position and a feeling of fatigue) he should 
band 2 (take a vacation). Butif at time 20 the wheel to be spun is 
{ 31 (same financial position but no fatigue) then choose band 1 
pone the vacation). 

wicome, in the present discussion, is a time pattern of consumption 
Cp). If the C, represent amounts of money devoted to 
mption, an outcome is a “history” of consumption expenditures. 
{, are vectors of goods and services enjoyed, then consumption is a 


Figure 1. Random future opportunities. 


In our general discussion the reader has the option of int 
either as a single number (c.g., consumption expenditures if 
dollars) or as a vector of goods and services enjoyed. In 
contexts C; is further specified. Consumption could also be 
variable (or random vector) that depends on where the wheel 
is a question of convenience of analysis, since any situation 
described with random C, can also be described by suitable 
non-random C,; and vice versa. 

We assume that each wheel has a finite number of band 
has a finite number of places; and there is a fixed finite nu 
spins to be made. 7 might represent a longest possible lifet 
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time series of such enjoyments. Since we assume that there is a fini 


ye max (1, Cy/C)) is either 1 or C,_4/Cy whichever is larger. In this 
ber of wheels and time periods, there is also a finite number of 


ple, if C rises through time, w is the (discounted) value of the loga- 
outcomes. We can therefore label possible outcomes with ni ws of consumption. But, if a C is lower than the preceding C, w 
Vedareri ceili nn ssi Is the extra disutility of reducing consumption compared to keeping 
The nature of the world and the choice of a strategy deter ‘Han intermediate level. We attach no special significance to the above 
probability distribution of time patterns (C), C,.* © *, Cp). A present it as illustrative of considerations which can be incorporated in 
probability distribution can be described in two different ways: wility function, 
WN ppose that the probability distribution of outcomes of some strategy 
{/\.° + *:Py) Suppose that a second strategy has a probability dis- 
,). We can imagine a new strategy which 
Ms with the flip of a coin. If the coin comes up heads, strategy 1 is 
wed thereafter. If the coin comes up tails, then strategy 2 is followed 
The probabilities of outcomes associated with this new 


(i) Each possible C, has a probability. Given that a particular 
occurred, there are conditional probabilities associated with each 
, Given that a particular C, and C, have occurred, there are coi 
probabilities associated with each possible Cy, etc. 

(ii) Each of the n possible time patterns (C,,- « , Cy) has a prot 
These probabilities may be denoted by py,* * *s Pis** °sPne 


re 
The probabilities of (i) are equivalent to those of (ii). Either set (U2)(Prs* > + Pa) + (U2NQis* * *s 4nd 
inferred from the other.’ Formulation (i) emphasizes a dyna 
folding view of the subject. Formulation (ii) presents the same 
tion in a manner more convenient for our purposes. We shall 
employ formulation (ii). 

We assume that our Rational Man chooses a strategy on the 
resulting probability distribution of outcomes. Thus the choice of 
is reduced to choice among alternative probability dist 
(Past + Pads 

The expected utility maxim asserts that a number u, should be 
to the jth possible outcome; probability distributions (p,,° > 
be ranked according to the linear ordering function: 


|ATIC TREATMENT OF MULTI-PERIOD CHOICES 


i) the last chapter we showed that certain axioms concerning probability 
Wibutions P, Q, R. and the preference relationships >, >, and ~ imply 
preferences are in accord with the expected utility maxim: if P, Q, R, 
and ~ are consistent with the axioms, P is preferred to Q if and only 


Lup, is greaterthan Yu, 


Ne symbols P, Q, R, >, >, ~ can still be used in the analysis of 
ior over time. P, Q, R still represent probability distributions, but 
probability distributions of time patterns of consumption. >,>,~ 


U = Lup; Will represent preferences among P, Q, R. The relationships of 

4 2 ms I, II, and Ill—as expressed formally on page 236, Chapter 
A utility function is any rule which associates a number u to each | in still be aséeried ing th tPOR. and ae 
time pattern (C,,- «+, Cp). For example, with T= 3 and ‘ed concerning the present P, Q, R, >, >, 


tie axioms about the probability distributions (P) of outcomes 
* +, Cp) are accepted, then the multi-period expected utility 
WM must also be accepted for this case. The proof at the end of the 
Mhapter, based on the formal properties of the axioms, applies equally 
é J) single period or multi-period analysis. 
Seance # Must consider, however, whether formal relationships expressed in 
Pr(Cha + +, C4) = Pr(CPYPHCH|CP) « « * PrCHICH > + ‘+*/oms appear as plausible when applied to probability distributions of 
Wes Spread over time as they do when applied to outcomes resulting 
‘ingle period choice situations. The present section considers the 
‘bility of the axioms in their new application. I believe the reader 
found the axioms convincing in the last chapter will find them equally 
wing now. 


amount of money devoted to consumption in period ¢, we might 


u = log C, + .9 log Cy + .81 log Cy — 5 log [max (1, Cy/Cy)) 
— .45 tog [max (1, 


and conversely 


yD mach 
PACICE: - -, Cha) = Ae 


CE Cra 
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The axioms can be interpreted either in terms of preferences 
probability distributions P or else in terms of preferences among the 
egies which give rise to these probability distributions. The 
approach emphasizes the formal identity between the present axiot 
those of the last chapter. The second approach, in terms of st 
parallels an argument to be used in Chapter XII concerning prot 
beliefs. 

The formal statement of the first axiom is 


In the above example the Rational Man had a choice of two situations 
h can be represented schematically : 


probability 62 strategy 1 
beeen a a a 
Probabiity 33 strategy 3 
probability 62 strategy 2 
Situation 2 ocean fH 
Probabiiny 33 strategy 3 
Fis the probability distribution of outcomes associated with strategy 1, 
the probability distribution associated with strategy 2, and R the 


ability distribution associated with strategy 3, then the probability 
\bution associated with situation | is 


Axiom Ia: For any two probability distributions P and Q, 
P20 or Q2P (or both). 
Axiom Ib: If P > Q and Q > R, then P > R. 


The first part of the axiom says that the Rational Man, or Perfect’ 
puting Machine, can make up its mind about a course of action. 
tell whether he prefers the strategy which gives rise to probability 


bution P, or prefers the strategy which gives rise to the pr 62P + 38R, 
distribution Q, or considers choice among these two courses of Peek ow De ; Pasenss 
matter of indifference. The second part of the axiom says thal Wp the probability distribution associated with situation 2 is 
prefers a first course of action to a second, and a second to a third, 620 + 38R. 


will a fortiori prefer the first to the third. This ordering axiom 
me to be as compelling when applied to strategies and the prol 
distributions associated with them as it does when applied to pi 
distributions of single period outcomes. 

The formal statement of the second axiom is 


ming that choice should be based on the probability distribution of 
mes and not on the particular way they are generated, Axiom II is 
Walent to the assumption that the Rational Man should always take 
‘iiher than $2 whatever $3 and whatever the probability with which $3 
: < 4 Wh the choice of $1 or $2 is determined. 
Pot ee Q, if Ris any probability distribution, and ais any Birsion ofthethid strom 

aP+(1—a)R>aQ+(l—a)R. 
Let us illustrate this axiom in terms of a mythical example: as the 1)) there is a probability a # 1 such that 
Man is about to leave his native land to journey through the Real (PEGS oe 
the Keeper of Strategies presents him with two large, finely printed 3 
describing two complete strategies S! and $2. The Rational Man 
determines that he would prefer to follow S! than to follow 
Keeper of Strategies declares that he will spin a wheel marked 
numbers 1 through 100. If any number from 1 through 62 re 
Rational Man will be required to follow strategy 3, contained in 
book. If a number from 63 through 100 results, the Rational 
have his choice of $1 or $2. The one he chooses must be follo 
with and henceforth. The Keeper of Strategies asks the Rational 
choose between SI and $2, Remembering that he preferred S110 
Rational Man announces “Proceed with the spin, I will choose $1 
than $2" even before he opens the book $3 to examine its cont 


WM IIT: If P > @> R, then 


| here is a probability 6 #0 such that 
Q>bP+(1—dR. 


is, (( strategy SI is preferred to S2, and strategy S2 is preferred to $3, 
there is a probability (perhaps .999999) such that this large a chance of 
Wilead of S3 is preferred to S2 with certainty; while, on the other hand, 
16 4 probability (perhaps .0000000001) such that this small a chance 
‘*iaining St rather than S3 makes $2 preferable, 
Noms 1, I, and IH’ are not subject to proof. If there were more 
Hble principles from which these could be derived, then these more 
Nible principles should be used as axioms. All that is subject to 
Hintical proofis the relationship between the axioms and the expected 


250 


PORTFOLIO SELECTION 


utility maxim. The Rational Man would give equal assent to 


axioms and the maxim since each is a verbilization of the other. " 


Intermeorate Decisions, INcomecee Outcomes 


In the last chapter we saw that errors can be made in applying 
pected utility maxim if relevant intermediate decisions are omitted 
the analysis, or if the set of “outcomes” is incompletely or in 
spelled out. We now illustrate a corollary: even if the expected 
maxim applies to action-as-a-whole, it may not apply to some ii 
aspect of action, This is pertinent to portfolio selection, si 
selection of a portfolio is almost always one of a number of 
carried out by the individual investor, the investing institution, 
benefactor of the investing institution, 

Let us consider a hypothetical illustrative example: at time 
individual must choose a portfolio. At time 2 he must decide w 
Not to take possession of a factory. At time 3 his portf< 
assume to be his only source of funds) is worth either $2,000,000, $1, 
or $500,000. The probabilities of these three outcomes depend 
portfolio chosen, Also at time 3 the individual is required 
$1,000,000 for the factory, if he has taken possession. If unable 
he loses everything. At time 4 he gets $10,000,000 from the fact 
took possession and made payment at time 3. Thus the amou! 
he has at the end of period 4 may be summarized as follows: 


If he took possession and the value 
of the portfolio became 
then he would have 


$2,000,000 
11,000,000 


$1,000,000 
10,000,000 


If he did not take possession and 
the value of the portfolio became. 
then he would have . 


$2,000,000 
2,000,000 


$1,000,000 
1,000,000 


Let us assume that utility depends on final fortune as follows: 


8 O) 500,000 | smo 2,000,000 | 10,000,000 


u 0 z. 10 


3 | 5 


Let p,, py, p; be the probabilities that the portfolio will be 
million, one million, and five hundred thousand, respectivel 
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vidual takes possession of the factory, then the combinations of 
Pw Ps which yield the same utility U= U® satisfies the equation 
= 10.5p, + 10p, + Opy. If the individual does not take possession, 
combinations which yield U = U9 are those satisfying U° = 7p, + 
+ Ips. The factory is taken if 


10.5p, + 10p2> Tp + Spa + 3ps- 


bability combinations which yield the same utility are illustrated in 
ve 2, p, is on the horizontal axis; p, is on the vertical axis; ps is 


Figure 2. Iso-wility curves, 


implicitly by py —1—p,— pp The “indifference curve,” all of 
points have U = Uj, isa bent line. On one side of the break it has 
formula 


U; = 10.5p, + 10p,. 
the other side of the bend it has the formula 
Y= 7p. + Spa + 3ps 
= 414243. 
beak occurs at the heavy line whose formula is 


10.5p, + 0p, = 4p, + 2p2 +3 


6.5p, + Bp, = 3. 


Weighted sum u,p, + up, + sp, can represent the (non-linear) 
fences portrayed by the indifference curves of Figure 2. 
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In this case the Rational Man acts on the basis of the expected 
for action-as-a-whole, but not for portfolio selection by itself, 
investor has preferences among probability combinations (7;, pgs, 
portfolio returns; but these preferences are not in accord wil 
weighted sum Su,p,. Later we shall consider an example in whi 
investor cannot order portfolios by (p,. Pps) at all, but must co 
joint distribution which is incompletely reflected in the p,. 

If the individual takes the factory, the utility which he att 
various portfolio returns is: 


$ (in millions) 


4 


Another way of saying the same thing is that, in this case, the 
attached (o three possible returns (r) are given by the following uti 
money function: 


() 


Since there are only three possible returns, the quadratic w= % + 
can be made to fit exactly. In the next chapter we consider the al 
the quadratic to approximate a true utility curve when many outcos 
possible. The negative sign of ;(—= —12.33) follows from the co) 
of the utility function. The portfolio which maximizes the expec 
of u above is one with minimum variance for given expected return, 
the Rational Man needs to know only the set of portfolios which 
variance for various levels of expected return in order to find the 
which gives greatest utility—assuming that the factory has been tal 
the factory is not taken, then the utilities associated with the three 
outcomes are: 


— 165 4+ 3957 — 13.00r2. 


§ (in millions) 


uw 


| These are combinations of r and U indicated by the following W 
money function: 
(2) 


— 34 4+ 5.997 — 1,334, 


The portfolio which maximizes (2) once again gives smallest V 
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We of E. Hence, if he knew only the set of portfolios which minimizes 
for various E, the Rational Man could 


{s) determine the maximum expected utility if the factory were taken, 

{h) determine the maximum expected utility if the factory were not 
taken, and 

(0) compare these to determine whether or not the factory should be 


hen and, depending on this, which portfolio should be selected. 


{¢1 us amend our previous example so that there is a probability g that 
factory will have a small ($500,000) return in period 3, in addition to 
ge return in period 4. The return from the factory in period 3 is not 
wearily independent of that from the portfolio. The correlation 
en portfolio return and factory return may vary from portfolio to 


folio. The return to the individual if he takes the factory may be 
narized : 


Total Return, Depending on Factory Return 
and Portfolio Return 


Value of Portfol 
Factory Yield, su 


t=3 
2,000,000 | 1,000,000 500,000 
500,000 11,500,000 10,500,000 10,000,000 
0 11,000,000 | 10,000,000 0 


thie present case the decision-as-a-whole, including choice of portfolio 
Huying the factory, depends on the available joint distributions whose 
bilities p,, are defined: 


Vi 
Factory Yield, ‘alue of Portfolio 


=3 
‘ 2,000,000 1,000,000 500,000 
500,000 Pu Pas Pas 
0 Pa Pa Pas 


Wobabilities p,, pp, py of returns on the portfolio, and the probabilities 


‘We (1 — 9) of different returns from the factory, are sums of the p,,. 
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it SEQUENCE OF DECISIONS 


Consider an individual who follows a strategy S so as to maximize the 
ted value of a utility function U(C,,- - +, Cp). At time 4, the indi- 
wal has already consumed C?,-- -,C?;. Depending on his current 
Wwe and luck, he will consume C, and face a next wheel w,,,. For 
iilicity let us suppose that there are two possible C, (namely, Chand C?) 
two possible w,,, (namely, 1» and w%). The result of current choice 
ehance then is 


(Cw!) or (Cw) or (Cw!) or (C4 x2). 


Total] Pa Ps 1 


A given set of probabilities py, po, Pssg1s (1 — 4) are usually consiste} 
a variety of joint distributions (p,,). It is therefore not always po: 
determine which of two portfolios is more desirable when onl 
probabilities p,. p2, ps are specified. 

In the present example it is impossible to rank portfolios solely 
basis of their probabilities of returns (r). We must also consi 
correlations with other things. 

In contrast to the above examples, there are conditions under wh 
can ignore the details, or even the existence of some aspects of 
deciding on the details of others. The footnote presents an ill 
set of conditions which allow one to separate two areas of deci 


IY). 42 4x. and q,, respectively, be probabilities associated with each of 
Different bands on the current wheel have different g,, 

Hyppose that, in fact, (C*, w*) results. We may define a conditional 

ed utility given that: 


(8) CR, + +, C8, have already been consumed, 
{)) C'is the current consumption, w! is the next wheel to be spun, and 
[) Strategy S will continue to be followed. 


\ “conditional expected utility” is the expected value of final utility, 
{ing with a situation in which (a) and (b) hold, and following strategy S 
il the last period, 7. For the present discussion let 


! The following are sufficient but, presumably, not necessary conditions, 
thorough analysis of such conditions might prove interesting. 
Suppose that the strategies (5) among which an individual chooses are c0 


of two substrategies (A and B), A might be the strategy followed in earning: ' MUTI 
ing money; B might be the strategy followed during weekly walks in the Bees, (e COnarUOTA Cex Beco Uulity associates NUK a 


Similarly outcomes (O) can be split into two parts, , and O;. The first might , equal the conditional expected utility associated with (C’, w*); 

the amounts of goods and services bought; the second, the animals ai qual the conditional expected utility associated with (Ct, w); 

enjoyed while on strolls, The probability distribution of outcomes O, 4, (qual the conditional expected utility associated’ with (C*, w). 

choice of A; the probability distribution of outcomes O, depends on the Re i " ; se ; 

O, and O, may be correlated, depending on the pair of strategies 4 and B. @xpected utility associated with a band with probabilities p,, ps, Ps, Pais 
We shall assume that the set of opportunities available with respect to 4 

depend on the B chosen, and vice versa. The individual attempts to U= mp, + Wype + Maps + Pale 


expected value of a utility function U(Q,,Q,). We shall say that O, is j P enh 5 
independent of O, if, for any 1wo possible O, outcomes (eg.. Of and O38) the choice of bands from the current wheel is to be consistent with an 


vidual's preferences among probability distributions of O, if OF were imum strategy it must maximize the above expression. Similar con- 
same as his preferences among probability distributions if Of* were certain, vations apply when there are more than four (C, »') combinations. 
another way, Q, is preferentially independent of O, if U(O,, Of) and UO. fh the last chapter we considered an 

the same function of O, (except for choice of zero and unit). Hence U(Q,, Bi for a single period. . “Outcomes 


written : 
is include amounts of money won or lost, a new car, a new job. Such 


U(Oy, 0,) = Ul0,) + MOYT(O,). 

Special cases of the above function are these (wo cases Oy is independent of O, a5 well as vice versa. If U is of form (3), 

i wha wicarions 4 and B can be chosen without regard for the choice of the other. If Uis of form 
ef pea then 4 and B can be chosen separately, without regard for the other, if 0, and O, 
a 


‘Wilependent random variables. If U is of the first form but cannot be reduced to 
‘9nd oF third form, then O, may be chosen without regard to O,; but the converse 
Hot hold. 


3) U= (0) + T(0). 
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objects constitute opportunities for the future as well as items of im 
consumption, In the terminology of the present chapter, the outcor 
last chapter's single period analyses were combinations (C,, 44). 
discussion of the present section indicates that, when an individual 
mizes the expected value of a multi-period utility function, he 
maximizes the expected values of a sequence of single period utility 
tions, The form of each of these functions depends on past consui 
In other words, at each period ¢ the individual maximizes the ¢) 
value of 


Charter XIT 
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U = UC, war | Ch > 4 Chas 


where C?,- + -,C?, is already determined, while (C, »,.,) is a 
period outcome. 

In some cases the relation between a multi-period analysis and 
period analysis is of great convenience. In others this relation is of 
less practical value. . Business decisions in general may involve the 
of new machinery, the construction of new buildings, the accumula 
work in progress, and so on, Suppose that, at the beginning of a 
decisions are made which by the end of the period will result in items. 
and used machinery being on hand or on order, various amoul 
inventory being on hand or in process, and so on. If we could 
utility to every possible end-of-period combination of assets, we 
principle) derive the best decision for the beginning of the period. 
problem of attaching a utility to possible end-of-period combinal 
machinery, buildings, inventory, cash on hand, and so on, is gent 
difficult as the original problem of choosing a best beginning-of-pe 
of actions, Thus the relationship between multi-period and single 
utility maximization does little, in itself, to simplify the general p 


The present chapter considers rational choice when objective proba- 
{ities are not known for some or all contingencies. Our discussion will 
Wy heavily on the work of Ramsey [31], Savage [32].!_ We shall see that, 
{he individual follows principles similar to those expressed in the axioms 
Chapter X, he will act in the face of such uncertainty as if he attached 
sonal probabilities” to each contingency. He will maximize expected 
lily, using these personal probabilities when objective probabilities are 
Known, 
To some, the maxim based on personal probabilities may seem self- 
Went. Not all persons who have reflected on this subject, however, 
Wve come to the same conclusions. (See Keynes [35], Shackle [33], [34].) 
i} necessary to relate the maxim to more fundamental principles, as we 
for the expected utility maxim, 
Ii will be unnecessary to discuss the implications of personal probability 
i n lu(s for the diversification of portfolios, the purchase of insurance, and 
business decisions through time. bling. The implications of Chapter X for such matters still apply, 


In Chapter XIII we discuss problems involving liquid assets. N\ subjective probabilities taking the role which objective probabilities 
cases the relationship between single period and multi-period iyed there. 


functions can be of appreciable value. Hi would be 


nvenient to have some undeniable principle which told us 
is! probabilities to attach to possible outcomes, The obvious principle, 
Suggested in many forms for hundreds of years, is that all possibilities 
Mould be considered equally likely, Such a precept could be given two 
Weipretations, One is that at the current time every possibility is as 
Whely as every other: for example, it isas likely that A. T. & T. will pass a 
Widend as it is that some recently organized company will. Such an 
Himption is completely untenable. The other interpretation is that all 
= were equally probable at some point in the past, presumably 
i) man (or the particular individual) first began to collect information 


! Faimalso indebted to Norman Dalkey for illuminating conversations on the subject. 
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about the world, The proper beliefs to use at this time are those 
can be deduced from the information accumulated since this begil 
Although such a rule might be correct in principle,! it is imposs 
apply in practice. 

Rather our approach is to admit that, at least by this time, sul 
probabilities are not necessarily equal. We seek from the security 
‘opinions on a limited number of basic possibilities and derive from 
relevant implications. 


I{ the individual chooses decision d, and if.a possible Nature of the World 
Wi in fact true, then there is a probability distribution P of possible out- 

mes. For example, if the hypothesis were true that security returns are 
fiierated by the same (particular) probability distribution through time, 
‘jd if the individual selected the strategy of always consuming 1% of his 
‘eulth and investing the rest in equal amounts in every security, then there 
ould result a probability distribution of possible time patterns of con- 
Wmption C,. Some such distribution is associated with every combination 
W decision and strategy. In case both the strategy and the Nature of 
Tie World do not involve objective random variables, the probability 
Wyribution of outcomes contains only probabilities of zero and one. 

Decisions and Natures of the World roust be defined so that the choice 
4 Strategy does not alter the Nature of the World. For example, if the 
Hiividual were certain that his purchase or sale of a security would not 
‘ller its price, then the statements about possible natures could be in terms 

ious prices, On the other hand, if the individual entertains the idea 
wi his action might change prices, then the Nature of the World would 
Wve 0 be stated in terms of supply curves. In such a case the decision to 
i) might increase the price, but it would not change the supply curve. 
Wh this way we meet the requirement that the choice of strategy does not 
‘ler the true Nature of the World. 

We assume that the individual has a preference ordering among certain 
‘Weisions. Among the decisions which he mentally orders are, of course, 
{Woe which are open to him. In addition, we shall assume that he is 

ible of ordering some “what if it were so” decisions which may not in 
fii be open to him. Two such “imaginary” decisions are the following: 


Concerts 


Our analysis will use concepts such as “outcome,” “decisio! 
of the World.” The purpose of this section is to define and clari 
present use of these terms. 

Our analysis will concern itself directly with the problem of bel 
over time. An outcome will mean in this chapter, as it meant in 
XI, a time pattern of consumption. This time pattern, in & 
problems, may be a series of money expenditures (perhaps deflat 
when we are concerned with theory rather than with actual compul 
it may be a series of vectors of enjoyments. It will be conve 
assume that there is a finite number of such possible outcomes. 

We shall also assume that there is a finite (though perhaps ¢) 
large) number of hypotheses about the world, one and only one 
ig true, One hypothesis about the world might include the state! 
stock prices behave as if generated by a joint probability distributit 
remains the same through time. Another hypothesis might as 
this probability distribution changes through time in a particu! 
Each such hypothesis will be referred to as a possible Nature of the 
As these examples illustrate, we admit the possibility that objective 
variables and probability distributions may be used in the definiti 
Nature of the World. 

A decision, in the present context, is the choice of a complete 
which starts now and continues until “the end." This strategy 
necessarily specify “buy A. T. & T. on January 16, 1964." Rall 
say buy it then if such and such circumstances prevail; buy even 
it if other circumstances prevail; sell it if still other circumstances 
In short, the strategy is a rule which relates action at each time to 
tion available at that time. The strategy may even specify thal 
certain circumstances the next act is to depend on the flip of a coil 
entry in a random number table, Clearly only a Rational Mai 
contemplate a typical sirategy or, a fortiori. choose an optimal o) 


{s) “What if it were so that you could have a particular outcome with 
‘Wilainty? How would such an alternative rank with the other decisions 
sidered 2” 

{b) “What if there existed a random variable with probability p of 
yierating a ‘one’ versus a probability (I — p) of generating a ‘zero'?” 
/pose further that we consider the decision of first flipping or spinning 
{hy whatever) this random device and then embarking on strategy A if 1 is 
Poerated, or embarking on strategy B if zero is generated. How would 
jwh a random decision be ranked among the others? 


We are not assuming that a random device does in fact exist, but only 
{Pi the individual can order in his preference scale hypothetical decisions 
Wrolving such a device, 

We could state our axioms in terms of decisions, including those des- 
‘ibed in (a) and (b). These axioms would imply that the individual's 
Whavior is consistent with probability beliefs. They would not imply, 


* It raises certain technical difficulties when the number of possibilities is | 
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however, that his preferences are necessarily consistent with only 
of probability beliefs. Whether or not the probability beliefs 
uniquely determined depends on whether the set of decisions ord 
the preferences of the Rational Man is sufficiently large and varied, 
original set of decisions is not sufficient to uniquely determine the 
bilities, there is a variety of “suppose it were so” alternatives wl 
ranked with the rest, would serve to determine the subjective pro! 
uniquely. For purpose of “proofs” the simplest such set may 
structed as follows: let us assume that there is an outcome O, which 
ferred to an outcome O, (i.¢., all outcomes are not equally good), $| 
further that O, is at least as good as any other outcome, and Op is no 
than any other outcome. Let the decision g, be one which gives 


For (wo outcomes i, and iy (such that ¢;, is preferred to ¢,, ande,, > 
© ¢), for all e,) we assume that D includes matrices g(j = 1,° - +n), 
ich corresponds to obtaining j, if nature / is true and iy otherwise. The 
iments of the matrix g,, satisly 


1 iff =iandj = jy 
pus dl iim igand + jy 
0 otherwise. 
Fameces. Suppose that there are three possible outcomes and four possible 


Islures of the World. Then a decision or strategy is characterized by a matrix 
Nas 


3 3 1/2 0 13 0 
certainty if state of nature N, is true, and gives Op with certainty Hh ' Hs in 
other state is true. If the individual ranks g,, g..° * -.2, for us, ar ihe aj 
with the other decisions mentioned above, then we could w 1} (3 1/2. 


determine his personal probabilities. 

Let us summarize these concepts in a more formal manner. 
the number of possible outcomes and n the number of possible Nall 
the World. A decision d can be represented by a matrix with mr 
n columns. The entry p,, in the ith row and jth column of the 
the probability that the ‘th outcome will occur if the /th state of 
is true (and if the particular decision is taken). The jth column P, 
a matrix is the probability distribution of outcomes if the jth 
mature is true, Clearly the entries of every column must sum to 

The individual has a preference ordering among decisions din & 
sel D. This set includes the decisions ¢.¢),.* * +, €,,, where 
hypothetical decision which gives outcome i with certainty whate 
true state of nature. The matrix associated with e, has ones in the 
and zeros everywhere else, If decision d! and decision d? are in 
so is the decision of spinning a wheel and having a probability 
following d! and a probability of (| — a) of following d?. If pl, 
probability of outcome / if nature is true and decision | is take) 
i, is the corresponding probability for decision 2, then the prol 
the ith outcome if the wheel is spun is ap!, + (| —a)p?,. In other 
if we think of d? and d# as the m by n matrices of the p!, and pi, 
matrix of the new decision is exactly d? = ad" + (1 — ayd®. We 
then that, if the matrices d! and d?are in D, so is the matrix ad! + (1 
for any OSa< 1. By taking probability combinations of 
€1,@.° * >. @, WE Can arrive at any matrix (P) which has the same 
bility distribution P, in every column. This corresponds to the 
thetical) decision which gives the same probability distribution 
comes whatever the Nature of the World. 


‘} strategy with this matrix is followed, and if the third possible Nature of the 
vid is true, then each possible outcome is equally likely, A decision which 
Wilably results in outcome | has a matrix 


BA hy § 
4=1/0 00 0}. 
0000, 


ilecision which gives probabilities 1/4, 1/2, and 1/4, respectively, to the three 
eormes whatever the Nature of the World is 


1/4 1/4 1/4 1/4 
[r= W212 i} 


ta 1/4 1/4 1/6. 


{i this section we first present and then discuss four axioms of behavior 
‘Shen some or all probabilities may or may not be known, The first 
Wee of these axioms are essentially those used in the case of expected 
Wility, The fourth axiom says that, if decision | has a better probability 
Wiiibution than decision 2, whatever the Nature of the World, then 
yivion 1 is better than decision 2, The following section shows that the 
‘#wns imply that choice should be based on probability beliefs. 

Dis a set of matrices (decisions) including the matrices ¢, previously 
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‘Axion IV: Let P!" be the jth column of ind P® be the jth column of d*: 
P' is preferred to P¥ for every j, then d? > dt. 


defined. If d? and d* are in D, then ad? + (1 — a)d? is in D 
0S@<1. By asfinition, 


d'~d* means d'>d? and d?>d!; Axiom I says that the individual has a definite ordering of decisions in 

d'>ad? means d! 2 and not d?>d!. Set D and is subject to no fuzziness of perception. Part (c) asserts that 
Axiom |: are considering a case in which all outcomes are not equally desirable. 
(\ (d) says that for any two possible Natures, /, and j,, the individual can 
ll us whether he would, so to speak, rather bet on the truth of /, or of jp. 
45 (c) and (d) will provide the uniqueness of our probability beliefs. 
Axiom Il says that, if decision 1 is better than decision 2, then a 
decision | versus decision 3 is better than the same chance of decision 2 
hus decision 3. This axiom may be illustrated by the mythical example 
1 Axiom If in Chapter XI. The Keeper of Strategies has selected three 
ge books, each containing a complete strategy, which tells (among 
ier things) what to buy and sell under specific circumstances at various 
fimes.' Our Rational Man has inspected each of these volumes and has 
(ded that he prefers strategy | to strategy 2. The Keeper of Strategies 
ounces that he will spin a wheel with probability p of generating a 1, 
Wd probability (1 —p) of generating a zero. If a 1 appears, then 
egy 3 must be followed. If a zero appears, then our Man has his 
wee Of strategy | or strategy 2. Our Man is required to state which he 
Willchoose. Our aaiom asserts that he will choose I rather than 2, if the 
tunity arises, whatever the strategy 3 that he does not get and the 
Wobubility p with which he did not get it 
Axiom III is a “continuity axiom” similar to that used in Chapter X. 
Hean be given a Rational Man interpretation in the spirit of the last 

agraph. 
1s 1/3 Axiom IV says that, if, for every possible Nature of the World, d 
12>] 13 | lias a probability distribution which is at least as good as that of decision 
} then decision 1 is at least as good as decision 2. Alternatively, if there 
Wile objective probabilities attached to the possible Natures, then, given 
{he conditions stated, the first decision would be at least as good as the 
‘Weond—no matter what probabilities were attached to the various 
ible Natures, Axiom IV states that, under such circumstances, the 
decision is at least as good as the second, even when no objective 
babilities are attached to Natures. Savage (32] refers to this as a Sure 
Ning principle since, whatever the true Nature, the probability distribu- 
{0 associated with d! is at least as good as that associated with d*. 
Ay shown in the next section, if the individual acts according to Axioms 

1 1, 111, and IV, he then acts according to the expected-utility personal 


(a) For any two matrices d' and d? in D, either 
d'>d* or d*>d* (or both), 


(b) If d! > d? and d® > d®, then d' > a. 
(c) There is an i, and i, such that e,, > ¢,,. 
(d) For some particular pair i, and i, such that 


fy Bere, forall, 


+, m, where the elements of g,, salisty 


D contains g,, / 


1 iff=i and j= jo, 
Pu=4l ifisig and 7 Ajy 
0 otherwise, 
Axiom I: For d', d#,d?in D: If d' > d?, then 
ad! + (1 ~ a)d* > ad? + (1 = a)d3 for any a, 0 <a <I, 

Axiom IIL: Ifd!,d?, and d? are in D, and if d' > d? > d?, then the 

and a,0 <a <1, such that 
? ~ ad" + (1 — a)d?. 

We define the probability distribution P* (among outcomes) 
least as pood as P# (written P! > P2)if the matrix (P) all of whose 
equal P! is preferred to the matrix (P*) all of whose columns 
That is, 


means, by definition, that 
Wa 1/4 1/4 1/4 13 1/3 1/3 3 
V2 1/2 2 t2}S]1/3 3 1/3 1/3). 
4 1/4 a 4/4 73 1/3 1/3 43 


This definition is required since we have not, heretofore, d 
postulated an ordering of the vectors 


‘They may even contain tables of random numbers to be used in agricultural 
‘Hifriments or in friendly poker games to fool an opponent occasionally. 
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Axiom 1 implies that the individual has an ordering for a set F of vectors 


the 7, are non-negative and sum to 1) such that the matrices d Ye (U,,°+ + U,). Since the utility of i, is zero, all U; 20. The 
be ordered by finition of D implies that F contains 
Be(Bp.4d- the vectors (1, 1,- ++, 1) and (0,0,-+ +0) and 


Or, in matrix notation, 


all the unit vectors (0,- - ,0, 1,0,+ + +, 0). 
abo implies that, if V4 and ¥? are in F, so is 


av? +(1—a)¥? for 0<a<i. 


v= (uy 


+5 Mad(Pi)) 


Tn, 


Axioms I and II imply corresponding statements with vectors ¥, 72, 
il V% replacing decisions d', d?, and d3 
Axiom 1V implies that, if 


describes the preferences among decisions d. 


DERIVATION V=(U,--+,U,) and V2 = (UI, > +, UY) 


In this section we show that Axioms I, II, III, and IV imply the 
of both utility and personal probability, The present proof relies 
on results already derived in Chapter X, pages 236 through 242 

First consider decisions e, and (P) whose matrices have the 
probability distribution in every column. These decisions give t 
probability distribution of final outcomes whatever the Nature 
World. Axioms, II, and III of the last section, applied to such d 
are equivalent to Axioms I, II, and HL of Chapter X. These axioi 
saw in the latter chapter, imply the existence of expected utility; | 
there exist numbers u;,* ~ °, u,, such that, if P= pe, +> -* + pal 


UP) = pu, 


indicates the ranking of P among such probability distributions. 4 
Axiom LV implies that, if P” is exactly as good as P® for all j, £1) (0) = 0, 
is exactly as good as d®. In particular, if U" = U® for all j (whe He Dis Me Reo NACI and w 3 
the utility associated with the jth column of decision (i)), then d! is W flav +0 = ayv4] = af) + (1 — av?) 
as good asd. Thus decisions can be ordered according to their OSa<i. 

utilities (U,, Up, * * “, Up), where U; is the expected utility of the 
if the jth Nature of the World is true. To. prove the existence of 
probabilities we must show that the vectors (U;,- ' +, U,) can be 
by a linear function 


in F,and U, > U? for all i, then Vis at least as good as V+. 
We show below that Axioms I, JI, I{1, and IV imply that the vectors 


V=(U,->+,U,) 


i) be ordered by a linear function 
Driv, 


Sr,=1 and 7, >0. 


here 


{Nie course of our argument we shall meet a function /(V), defined for all 
in F, with the following properties 


# shall now show 
AV) =f. U,) = Ta, 
wwitably chosen ,. If 1 > 6 >, then 
SOY) = flbV + (1 — 6) = of ¥). 
land bY is in F, then 


=e 
where the m, are non-negative and sum to 1. 
We may assume that outcome i, has utility equal to 1, and jy hat 
equal to 0 (Axiom I, part c). With these conventions, let us st bad 
immediate implications of Axioms I, Il, III, and IV for the orderii SOV) = BV) for f{(1/o\V] = (1/byf(bV). 


vectors V = (U,,- - +, U,). He for b > Oand Vin F: f(b) = bF(V) if BV i 


Fo Leta: +e, 


266 PORTFOLIO SELECTION 


be then unit vectors (0,- + -,0,1,0,---,0). Any vector V =(U, Where 
with all U; > 0, can be expressed in terms of the e, as follows: 
VO = @, 
VO = pve + (1 — pel, 
VEO = pVid + (1 — pet, 
Vim) = p,Vim-l) + (1 — pe", 
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Pa(i,l,-- +1) and ¥=(0,0,-+-,0)=0. 
‘This follows from Axiom IV. 
For every V in F we define f{V) to be the number f such that 

View fP (l= f)¥, 
larly (7) = Land (¥) = 0. 

§, If 0<a@<1 and V?, vare in F, then 
Sla¥* + (1 = a4 = af(V4) + (1 — affV9), 

Proof as in Chapter X. 
6.1V=(U,- ++, U,)isin F, 

AV) = LU fle) 

= 2, 

Where m, = fle,), as shown earlier in this section. 
1 Sm = 1, since 1 = f(P) = Fn, 
4, m, =0 follows from Axiom IV. 


and finally 


V=(YU)V"; 


for suitably chosen pj: 0<p,<1. This follows from the 
Chapter X, page 236, which implies that V' can be expressed as 


Repeatedly applying 
Mev + (1 = ped = psfV) + (1 = piled) 
and finally 


ALU: v") = (Luv), 


we find that 


Narune oF tHe 1, | 


AV) = My Ue) : ; 
= SU,fle) Ih the last section we argued that decisions may be ordered by their | 
Su; ‘ ‘Welors of utilities Y= (U;,-- -, U,), where U; is the expected utility 
= Un, 


Mhociated with a decision if the ith Nature is true. We saw that, if the | 
Wilvidual’s choice is consistent with the axioms of the preceding section, 
#) his ordering of the V can be described by a function 


v= YaU, where Sm, = 1,7, >0. 
‘The present section considers the justifications for referring to the 7, as 
"probabil ities."" We shall see that: (a) when objective probabilities are 
wociated with possible Natures, the n, equal these objective probabilities; 
{h) when some objective probabilities are known, the subjective probabilities 
and the objective probabilities mix “on a par” with each other; (c) the 


Hbjective probabilities obey the same arithmetic relations as do objective 
Pobabilities. 


The proof that the V in F can be ordered by a weighted sum closely 
the derivation of expected utility in Chapter X. 


1. If V4, 2, are in F, and V~ V2, then 
ai +(l—aVi~V!, if O<a<l, 


Proof as in Chapter X. 
2. If V4, V2, and V2 are in F, and if V1 ~ V2, then 


aV¥ + (1 —a)¥3~av?+ (1 —a)¥® for O<a<l. 


Proof as in Chapter X. 
3. IFV', F4,and V are in F,if V1 > V# > V3 and if V4> VU 
is one and only one number (a) such that 


Vt ~aV? + (1 —a)¥3. 


{#) When objective probabilities are known for all contingencies, the 
Bational Man, following the axioms, maximizes expected utility, In 
Hiicular, if objective probabilities are known for the various possible 
islures, the weights in 


Proof as in Chapter X. 
4. For any Vin F we have 


Vevey, 


b= 3nU, 
uld be these objective probabilities, 
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(b) Suppose it is known that one Nature of the World, 4, B, or 
truc. Suppose it is further known that, if either Bor C is true, the 
50-50 chance that it is the former rather than the latter. Since there! 
objective probability attached to A versus (B or C), we cannot attal 
objective probability to either B or C individually. However, if subj 
probabilities are to mix “on a par” with objective probabilities, we 
have my = my = (1/2)m4 ory Is this necessarily true for con: 
rational behavior ? 


it perfectly satisfactory in the case of subjective probabilities as well. In 

eral, we may define the probal of a set of possible Natures to be 
4 sum of their probabilities! Whenever utility is the same for all 
Members of the set, we may say that a wi is attached to the statement 
one of the Natures in this set is true,” and we may multiply this utility 
‘Wy the “probability of the set,”’ as illustrated above. 


When there are a finite number of possibilities, all the mathematical 
Welations among probabilities follow from the facts that: (a) the proba- 
Wlity of a set of (mutually exclusive) possibilities is the sum of the 
frobabil ies of the individual possibilities; and (b) the proba lity 
‘Wociated with the set of all possibilities is 1. All other relationships 
‘ong probabilities follow from these properties and from definitions 
Wf concepts of interest. If definitions are made for probability beliefs 


‘hich parallel those for objective probabilities, then similar relationships 
Iollow. 


The present situation may be analyzed in two different ways: 
one hand, we can say that there are three possible Natures, A, B, 
and that each decision is characterized by three probability distribu 
P4,Py, Pc. Or, on the other hand, we could say that there are two 
of nature, A and D, where D represents either 8 or C being true, 
latter form of analysis each decision is represented by two distribi 
Py and Pp, where Pp = (1/2)Py +(1/2)Pe. The Rational 
preferences are such that the way we analyze the situation d 
influence his ranking of alternatives. Therefore 14, 7, %™ must 
as to produce the same orderings in both modes of analysis. Cons 
the latter form of analysis, we find that there are numbers 7, and my 
that preferences among decisions are described by 


Vim a Ug + my = 74U 4 + mol(l/2Up + (U2)Uc} 
Uy +N 2)epN ay + (2p U; 


Therefore we must have mp = me = (1/2)mp. 

A different kind of situation containing both objective and sul 
Probabilities is one in which there is a 50-50 chance either that 4 will 
or that(B or C) will occur. However, no objective probabilities are 
for B versus C given that (B or C) is true. In this case, if subjecti 
objective probabilities are to mix “Yon a par," we must have 7), + 7 
Suppose that this were not so; then the utility Tala + yl He 
attached to the utility vector (0, 1, 1) would not equal 1/2. (0, 1, 1) 
not be considered exactly as good as (1/20, 0, 0) + (1/2), 1, I). 
this cannot be so since these both represent the same objective prol 
distribution. For, by hypothesis, the decision (0, 1, 1) has a 50-50 
of receiving U = 1, as does the decision (1/2X0, 0, 0) + (1/2)(1, 
Thus we must have rg + x = 1/2 

(c) Suppose there are three possible Natures, 4, B, and C. § 
further that we are considering a set of decisions all of which have U, 
Then V = aU, + 7_Uy + teUe = Uglty + 1m) + 1eUe. I 
were objective probabilities, 7, + 7, would be the probability that ) Since one and only one Nature is possible, these alternatives are “mutually ex- 


Aor Bis true, But we see from the above equation that this interp\ "In the case of objective probabilities the probability of any of several mutually 
ivsive events is the sum of their probabilities. 


posteriori BELiEEs 


As information accumulates, a priori beliefs become a posteriori beliefs. 
shall illustrate, by means of a two-period example, how information 
lects subjective belief. (A more complete discussion of this subject will 
B found in (32}.) 
Suppose that, at the beginning of the first period, an investor has $1.00. 
lf may either invest this'll in cash or all in a particular security, If 
ested in cash, he has $1.00 at the end of the period with certainty. 
invested in the security, the $1.00 becomes either $.90 or $1.20. At the 
pinning of the second period, the investor must again place all his funds 
fiiher cash or the security. Again there is inevitably a zero return on 
yh, while the return on the security will be cither —.1 per dollar invested, 
4.2 per dollar invested. 
There are four Natures of the World: (1) the security will yield first 
~ |and then ~.1; (2) it will yield —.1 and then +.2; (3) it will yield 
+ Athen —.1; and (4) it will yield +.2 and then +.2. There are eight 
an strategies which we assume the investor can employ; two of these 
(1) invest in cash in both the first and second periods; (2) invest in 
‘lin the first period; invest in cash in the second period if the return in 
Wy firstis —.1, but invest in the security in the second if the return was 
2 
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The four possible Natures of the World and the eight possible st 
are listed in Table 1. CSC represents the strategy which first in 
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Let us consider the conditions under which our individual will prefer 
‘itmtegy 5 to strategy 6. In order for this to be so, we must have 


3m, + 3m, + Ins + Tm > 3m, + 3m + 6m + Bry 


Taste | 
DOLLAR AND UTILITY OUTCOMES 


This will be so if and only if 


Ins + Tm > 61 + Bry 


Strategy Nature 


Which in turn is true if and only if 


Period 2, if (CF »| ney 12)| (1.2, 9) a 
Period} 


) pp gta us “th 


>6 +8 
ie nim mem mtm mtm, 


ry + 7) may be defined, analogous to the corresponding definition for 


Shelf kee he cenit Mere: 5 qi 5 ‘Whjective probabilities, as the conditional probability that Nature of the 
DWE GH Gr Sar) Al 00s: teROd eS 9° 3 World 3 is true given that —.1 is the return on the security in the first 
er pea ake 90 3 | 120 7 | 100 5 Hiod; and similarly for /(m, + 7). If there is a —.1 return in period 
Sed fede STi er 90 3 a3 8 | then both a person following strategy 5 and one following strategy 6 
eaeMeeclee rude eacee LA gene ld find themselves in the same position at the beginning of period 2, 
ER HBA Bra fatanivelin eat nome Hoth would have $.90 plus the information that a .9 return had occurred in 
BE [SalfeS ll oSi | peli fhe n |) PORRG 


fod 1, Inequality (1) says that the individual's choice at this point 
= as to maximize utility given the a posteriori probabilities based on 
ii evidence thus far. 
Similar conclusions are reached if we compare strategies 1 and 2 ifa 
J return occurs in the first period; or strategies 2 and 4 if a —.1 return 
us in the first period. Similar relationships between a priori proba- 
Wiies, information, a posteriori probabilities, and utility maximizations 
lr if we consider choices involving more than two periods. 


cash; then invests in the security if return was —.1, in cash if t 
was +.2, (9, 1.2)is the Nature in which return is first —.1 and 
To each combination of strategy and Nature of the World we 
ate a final money outcome: for example, if the individual uses s 
and Nature of the World 2 turns out to be correct, then the fins 
outcome is $1.20. This follows from the fact that the individual 
in cash in the first period, invested in the security in the seco! 
(because of the —.] return in the first period), and made $1.20 
invested in the second. The possible final money outcomes are} 
1.00, 1.08, 1.20, and 1.44. We shall arbitrarily assume that 
function of final money outcome is: 


OWABILITY BELIEFS AND THE INDIVIDUAL 


Appose that an individual attaches subjective probabilities .6, .2, and 
| fespectively, to the possibility that a coin has 2 probability of .5, 4, 
i) 6 of coming up heads, The individual believes, however, that, what- 
/ the probability of a head, it will remain the same for ever. He further 
lieves that each flip of the coin is independent of every other Aip (in 
His of the true objective probabilities). On the basis of these beliefs, he 
aches a .5 probability to the statement, “A head will appear on the first 
pol the coin.” Before the coin is tossed he also attaches a .5 probability 
the statement, “A head will appear on the ith toss.” At first sight, it 
ht seem that, since he attaches @ .5 a priori probability to a head appear- 
on cach flip, the law of large numbers must force him to conclude that 
the long run heads will almost certainly appear approximately 50% of 


The utility as well as the money outcome attached to each com 
strategy and Nature of the World is presented in Table 1. 
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the time. Yet our initial assumption was that he attached only 
probability to this statement. 

There is no inconsistency here: the probability belief he attaches 
statement, “Heads will appear on the second flip,” is not independe 
what happens on the first flip. In terms of probability beliefs, the 
comes of the flips are correlated to such an extent that the average 
large number of flips has a variance substantially greater than zero, 


Of predictions and constant education without substantial biases on the 
Whole. 

2, The connection between objective and subjective probabilities is 
quite close. We noted that they mixed on a par with each other in the 
tlculation of expected utility. The discussion of this section indicates 
‘Mother connection. To assert that some physical experiment has a .6 
robability of producing a result A is to assert that, if this physical experi- 
Similar remarks apply if a large number of coins are to be ‘Went is carried out and if a large number of other physically independent 
simultaneously, The individual may believe that every coin has the: ‘periments (of the same or different kinds) all with a probability of .6 of 
true probability of appearing heads. He may also believe that, in te JWoducing some particular (though perhaps different) result A are also 
the true probabilities, each flip is independent of every other. Finé fartied out, then the relative frequency of A will almost certainly be .6. 
may attach a subjective probability of .5 to the statement, Winilarly, to assert that a .6 personal probability is associated with an 
appear on the ith coin.” Yet he may not be sure that the propo! is to assert the belief that it is virtually certain that the relative 
heads will be close to .5. The reason, as in the previous case, is t frequency of correct predictions among a set including this and a large 
conditional probability he attaches to the statement, mber of other psychologically independent events is Thus personal 
turns up heads,” is not independent of the statement, Probabilities and subjective probabilities are connected via the notion of 
through 99 all turn up tails.”” Wiitive frequency in the long run. 

He may similarly attach an a priori probability to the statement 
estimating security prices he either generally estimates too high or 
ally estimates too low. Thus part of the subjective correlation 
estimates may be due to possibilities about the individual as 
possibilities concerning the world 

Suppose that we note an event to which the individual attae 
probability of occurrence. We wait until the prediction proves 
either true or false, and then find another event to which the in 
attaches a .6 personal probability—given the information then & 
including the fact that the first prediction was or was not true, We 
this process, at each stage noting a prediction to which the i 
attaches a .6 probability.! By construction the probability belief 
ith prediction will be correct is independent of the outcomes 
preceding predictions. The variance (in terms of probability 
associated with the average number of correct predictions is 
zero if the number of repetitions is large. 


Two morals can be drawn from the above discussion: 


1. The existence of personal probabilities does not necessarily i 
as of the moment, the individual is positive that his beliefs are: 
beliefs.” He may admit the possibility that he currently is either 
overoptimistic or always overpessimistic, or in some other way 
biased judgment. However. the idea of probability beliefs does 
belief in an ability to learn with time and experience, to end a 


1 This procedure assumes that such a prediction can be found at each sti 
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the portfolio at the beginning of period + 1 (i.e., at the end of period 1), 
then, under our present assumptions, the single period utility function 


U = UCr Wi Cy Cees 6 Cad 


fan be written 


Cuapter XII UV UCr yen GCs 4 Cae 
Although it may not be apparent at first glance, the second expression 
Wepresents a substantial simplification of the first. w,, identifies a 
wheel” which may differ from other wheels in many respects, such as the 
HMounts of various illiquid assets held. y,,,, on the other hand, is but a 
Yingle number—the value of the portfolio. 

Under our present assumptions, therefore, the investor maximizes a 
Mingle stage utility L(C,, ¥,,,) in which utility U depends only on current 
fonsumption C, and the value of the portfolio y,,, at the end of the period. 
The form of the current utility function U(C,. ¥%41) may depend on past 
fonsumption C,, C,,* - +, C)_y. His present opportunities with respect 
Wo C, and y,,, are determined by his current wealth y,. 

The following three sections discuss three types of portfolio analysis 
hich, in principle, could be performed under the assumed conditions. 
The first of these analyses provides an exact solution to the problem of 
Portfolio selection through time. It may be thought of as a procedure 
hich a Rational Man might employ in determining his course of action. 
This exact method of analysis is well beyond the capabilities of real men 
‘id real machines. Its value to us is that it helps complete our picture of 
falional action by which more feasible modes of analysis may be judged. 

The second mode of analysis has the person reproduce his utility function 
WE) yi.) and has a computing machine find the portfolio which maxi- 
‘Wites the expected value of utility thus described. The third mode of 

lslysis is the efficient set approach generally employed in this monograph. 
We consider the extent to which each approach provides an approximation 
Wf the exact solution of the rational portfolio selection problem. 

Fach of the three types of analysis could, in principle, be adopted for 
‘Hachine calculations. In each case there would be a division of labor 
Miween men and machine, Some information would be supplied by 
Human judgment; some calculations would be accomplished by the 
Jomputer. For each type of analysis we consider the reasonableness of 
WW division of labor. Can the individual supply the required information? 
140 the machine perform the required calculations? 
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ASSUMPTIONS 


The first sections of this chapter consider portfolio selection whe! 
following three conditions are satisfied : 


(1) the investor owns only liquid assets; 

(2) he maximizes the expected value of U(C,, Cy,* > +, Cp), whe! 
the money value of consumption during the ith period (C, could, 4 
natively, represent money expenditure deflated by a cost of living 

(3) the set of available probability distributions of returns from 
folios remains the same through time (if C, is deflated consumptio 
it is “real return,” taking into account changes in price level 
probability distribution is assumed constant), 


Later we consider modifications of these assumptions. 
An asset is “perfectly liquid” if 


(a) the price at which it can be sold, at a particular time, always 
the price at which it can be bought at that time; and 
(b) any amount can be bought or sold at this price. 


Even though securities are not “perfectly liquid,” they are sul 
liquid for an analysis based on liquidity to be instructive. The 
illiquidities, among other things, are considered later. 

Condition (3) does not imply that the same security offers 1 
Opportunities at all times, The new and promising firm of today 
well-established or a defunct firm tomorrow—at which time the 
being “new and promising” is taken over by other firms. Theassii 
made at first and modified later, is that the opportunities from the 
as a whole remain constant. 

Perfectly liquid assets may be converted into cash, and cash 
converted into liquid assets without loss, If available probabil 
tributions remain the same through time, the investor's op} 
depend only on the value of his portfolio. If we let y,41 be the 
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Ti Dynamic PROGRAMMING ANALYSIS 


Techniques for obtaining best strategies over time are studied in the 
Wil of dynamic programming [17]. Without going into computational 
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details, let us consider how the Rational Man could apply such tech 
to derive his optimum strategy from 


(a) his utility function U(C,, Cy, Cy,* +) Cr), and ; 
(b) the “wheels,” described in the last chapter, which characteri 
opportunities over time. 


Wiviving at the state (C,, Cp.° «+, Cp.g; pq) and following the optimum 
{rategy from that point forward. The derivation of the best action and 
Wility associated with the state (Cy, Cp, © +, Cp..g; Wp_y) uses the utilities 
sociated with the next period’s state U(C,,* * +, Cp_4; Wp) as determined 
ly the previous step in the calculation. 
Continuing backward through time, for each possible state at time 
{7 = 2) we can find a best action and a utility U(C,, > + +, Cpgi pz) 
period and ending with the first. The best action is derived for Whociated with pursuing an optimum strategy after arriving at state 
possible circumstance in the last period. With this information: MO) +s Cr. From this we can compute U(C,,- ° 
Possible, in principle, to derive the best solution for each possible Ny.) associated with arriving at any possible state at time (7 — 
stance in the next to last period. From this it is possible, in t Wisuing an optimum strategy thereafter, This procedure continues 
derive the best possible action for cach possible situation in the a, through time until = 1. 
from last period; and so on, until the proper action for the first per Thus, for each time s, the computation derives 
determined. The proper action in the first period, plus the proper 
for each possible situation in succeeding periods, constitute the 
to the dynamic programming problem. These rules of action 
circumstances are the strategy which the Rational Man would f 
We define the stare at the beginning of time f to be consumption. 
plus the wheel that characterizes available opportunities. The # 
time #, in other words, is represented by the vector (Cy Cy, 
C.iw). Attime ¢ = 1, the state is simply (w,). At time T the 
(Cy, Cass Cpeyj wz). It is convenient to define the state 
(T +1) to be the vector of consumptions (C,, Cs Ca* «Gj 
number of different possible states which could occur at time ¢ 
number of possible combinations of past consumptions C,, Cy, 
and present opportunities »,. 
The utility function U(C,, C,, Cy* > +, Cp) attaches a utility 
time pattern of consumption. Thatis, it attaches a utility to each 
state at time T+ 1. For each possible state at time 7—for each 
combination (C,, Cy, ~~ -, Cp; wz)—there is an action, a choice 
on wy, with greatest expected value of U(C,, Cy** “Cr). The 
strategy specifies that under the conditions (C,, Cs, 
proper choice is this band with maximum expected U(C;, Cy, 
The expected utility associated with choosing the proper band of 
be referred to as U(Cy Cy - +, Cpai wr). It is the ex} 
associated with arriving at the state (Cy, Cy, * *) Cr. 
the proper action at that point. We can, in principle, att 
utility to cach possible state at time 7. » in general, 
For each state (C,, C,° °°, Cp2j wr) at time (T— 1), U 
choice of band on wheel wy, which maximizes the expected 
U(Cy, Cy + *+ Cras Wr) This maximum expected value may 
by UC), Cy, 2) Wy). It is the expected utility as 


The general approach is backward through time, starting with 1 


{1) the best action for each state, and 
{b) the utility associated with arriving at the state and following an 
imal strategy from that time forward. 


Ne choices in (a) form the optimum strategy. The utility functions in 
) are needed to work backward through time to the complete -best 
‘stegy. (b) is essentially the single stage utility function discussed 
lier. 
The difficulty of actually performing this computation depends on the 
ire of the “wheels” and the utility function. The general dynamic 
mming problem is orders of magnitude beyond anything we can 
eI to compute now or in the foreseeable future. Even under the 
filitions enumerated in the last section, whereby w, is completely 
acterized by the value of portfolio y,, computation of a best strategy 
lugh time, for an arbitrary utility function, must be left for the Rational 
i) and the Perfect Computing Machine. It is far beyond the available 
foreseeable computers, 
Porticular functions U(C,, Cy, Cs,* ++, Cp) simplify the dynamic 
ramming computation. Suppose, for example, that the utility 
Hion is one of the following forms: 


U = log C, + logC, +- + + log Cy, 


U=VE+(IVG +++ +(.9)'VCp 


U = UC) + ULC) + + + + Us(Cy), 


{¢ U,(C,) is a function which associates a utility U, to each value of C,. 
tise cases it can be shown that the proper choice at time ¢ does not 
fd on previous consumption. It is not necessary to compute a best 
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choice for each combination (Cy, Cy° * 4) C).13 ys), It is sufficie for a function which adequately describes the dependence of present 
compute one best choice for each value of y,. This itself can be a si ‘shoices on past consumption, the computation is probably infeasible. 

task. Depending on the nature of the utility functions. the compul 
of the best portfolio for a single y, may be greater than required 
entire efficient set analysis, Multiplying this by the number of y) 
from | to T, we see that we can easily have computing costs far bey. 
possible gain from the analysis. If the utility function were 
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The value of the dynamic programming analysis does not lie in its immedi- 
‘Ae application to the selection of portfolios. It lies, rather, in the insight 
ji upplies concerning optimum strategies and how, in principle, they could 
te computed. It emphasizes the use of the single period utility function. 
Hi shows how the sequence of single period utility functions could, in 
U = log C, + 9 log C, + 81 log Cy ++ > + + (.9)7 Flog Cy Winciple, be derived by starting with the final utility of consumption 


a Cr WC. Ca + +.Gp) and working backward through time, associating 
~ Slog [max (. <)] ae = (509) Hog {max (1 Millities with intermediate situations. 


such as discussed on page 246, the best action at time ¢ would de Tit Sincte Time Perion Uriuity ANALysis 
C,-: 48 well as on y,. We would have to compute a best action ft 
combination of y, and C,,. With this, or a slightly more com} 
function, we pass from the realm of uneconomical to the realm of cul 
infeasible problems. 

For some types of problems, dynamic programming has proved 
practical computing procedure. Certain problems of machinery 
ment and inventory control, for example, lend themselves natu 
conveniently to the dynamic programming approach. There i 
illustrative portfolio selection problems where dynamic pi 
techniques can be applied. Suppose, for example, we wish to see 
rational investor would shift between bonds and stocks if a 
picture of the world were correct. For this purpose it might be 
to distinguish only two or three “securities,” for example, bonds, 
and cash, rather than include the many individual securities of w 
portfolio selection problem. It might also be permissible to use 
what simplified utility function, rather than painstakingly seek # 
function, Under such conditions current dynamic programmi 
dures could produce an optimum solution 

For the actual choice of portfolio, however, the dynamic pr 
techniques cannot be used. They require too much both from 
machine: 


As of time r the Rational Man maximizes the expected value of a single 

Piiod utility fanction U(C,, w,,,). The exact shape of this function may 
‘Hyend on previous consumption. When the choice situation meets the 
{ice conditions presented in the first section of this chapter—when only 
Wiuid assets are held, when C, refers to money consumption, and when 
‘PHiluble probability distributions are the same through time—the wheel 
+ may be characterized by y,.,, the value of the portfolio at the begin- 
Hip of time r+ 1, 
Suppose that a Rational Man told us his current U(C,, ¥.41). Wecould 
pute the value of C, and the combination of securities which maximizes 
fxpected value, It is not necessary to know how this single period 
Hlily function was influenced by past consumption. As long as we know 
eurrent U(C), ¥4,) We can, in principle, compute the best current 
ive of portfolio. This suggests a possible division of labor between 
| men and real machines. The investor could specify a function 
© thas); the machine could compute the value of C and the proper 
we of portfolio to maximize the expected value of utility thus defined. 
Nather than require the investor to present us with a utility function 
1 /), we could ask him a series of questions whose answers would imply 
{y). The questions could be of the following types: 


{)) suppose that consumption equals C®, what probability combination 
sealths y? and y?is exactly as good as having wealth yf with certainty; 


1. From the investor they require a utility function U(C,, Gy, © 
In the next section we see that it is no small task to derive @ 
single period utility function U(Cy, Yi41) by questioning the in 
attempt to derive a representative utility function for consumpll 
time, if feasible at all, is nothing short of a major research pi 

2. Even with the simplest of utility functions, the require 
dynamic programming computation are far beyond economic j 


1) what probability combination of (C°, y°) and (C}, y') is exactly as 
as (C% y)? 


Would not be necessary to ask such questions for all possible combina- 
sof Candy. Ifthe investor answered with perfect accuracy, it would 
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be possible to construct his utility function from the answers to (m ti 
minus 2 suitably chosen questions (where m represents the num| 
possible values of C, and n represents the number of possible values 
The need to check the internal consistency of the human respoi 
increases the number of questions required. The acceptability 
approximate rather than an exact portrayal of U(C, y) reduces the nul 
of questions required. 

In some cases it is possible to separate the choice of portfolio from 
choice of C. This is so if either 
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With existing methods." The computing costs involved in (approximately) 
‘Maximizing the expected value of concave U(R) will generally be greater 
than those involved in performing an efficient set analysis based on mean 
Wd semi-variance. The latter costs, in turn, are greater than those 
jnvolved in performing an efficient sct analysis based on mean and variance. 
Although not negligible, the computing costs involved in maximizing 
#apected UCR) are not prohibitive? 
The advantage of a utility function, as compared with an efficient set 
‘alysis, is that the utility function is a specially constructed representation 
Of the investor's willingness to bear risk. If derived with care, it is a more 
fecurate portrayal of the investor's attitude toward losses and gains than 
ive (he general purpose utility functions implicitly assumed in the use of an 
Micient set analysis. 
In addition to its greater computation cost, the disadvantages of the 
Wility analysis approach concern matters of convenience and appeal to 
ie investor. The investor, the investment manager, or someone else who 
Pin speak definitively concerning the objectives of the investor must answer 
estions concerning preferences among probability distributions of return. 
HV the respondent has little or no experience with matters such as the theory 
#/ rational behavior, he may consider such questioning a queer way to go 
shout selecting a portfolio. Afterward, when the optimum portfolio is 
nally produced, we can tell the investor only that this portfolio is best 
because it best suits his utility function. This approach will probably have 
ps immediate meaning and intuitive appeal for him than an analysis in 
Which the investor is shown combinations of “risk” and “return” and is 
then asked to pick carefully the combination which best suits his needs. 
Thoosing a combination of risk and return is a more natural procedure 
‘hin expressing attitudes toward risk in terms of a utility function and then 
Waving the choice to a machine, 
Ii seems to me that the more convenient, more natural efficient set type 
alysis is a logical first step in the formal analysis of portfolios. When 


(a) the proper value of consumption C does not depend on oj 
tunities available from investments, or 

(>) the questions “What probability combination of wealths y! a 
is exactly as good as wealth y° with certainty?” always have the 
answer whatever the value of C, 


If one or both of the above conditions holds, then the portfolio ef 
chosen on the basis of a function associating a utility to each value of 
wealth yi. Alternatively, since y,,. = (1 + R)y, (where R is rel 
the portfolio and y, is present wealth), the portfolio can be chosen 
basis of a function U(R) associating a utility U to cach value of po 
return R, 

A utility function U(R) can be determined from the answers to qui 
such as “What probability combination of returns R® and R} are 
as good as return R* with certainty?" If answers were pel 
accurate, only (m — 2) such questions would be required—substay 
less than the [(m times n) minus two] questions required to de 
uc. y). L 

A division of labor in which the investor supplies U(R) and the mi 
computes the portfolio which maximizes its expected valuc is fensil 
both man and machine. 

Human requirements are reasonable; perhaps twenty or so qi 
concerning preferences among probability combinations can provis 
establish the general shape of the utility curve. An examination 1 One upproach is to approximate the curve by quadratic segments, and then use 
provisional curve and some of its implications can produce refing WMputing procedures similar to those described in Chapter 1X for minimizing semi 
which better represent the investor's attitude toward risk. At most, lance, 
hours" time is required. If the utility function is a committee prod) * The cost associated with the approach of the preceding footnote depends on the 
behalf of an investment institution, somewhat more time may be ni ies iss peer eo ba oe parpeccunietl oer fe 
Subjects generally find the derivation of their utility function an inlg + on wee Hist portfolio Np eiaeaen eee tally Tear bel TomeRiSY 
activity. Jy product of an efficient computation based on mean and variance. If two segments 

Machine requirements are not excessive if the utility function | ‘1 fequited, then essentially the procedures for an ordinary efficient set analysis based 
conservative, concave variety. It is unnecessary and uneconomil HH Wean and semi-variance are required. If more segments are used, the computing 
Tequirean “exact” solution. Good approximate solutions can be 0 aes pare A ek os a ae has Sod 
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efficient set analyses become familiar, when the use of computing mi 
to help select portfolios is more commonplace, when investment mai 
are ready to consider additional sophistications, this is the time to 
directly the single time period utility analysis to the selection of 
portfolios. 


Man. In Chapter VI we saw that the expected value of Ln(1 + R) could 
bie well represented by 

(Vv 

Ln(t + £)] - -——... 
{Ln(t + £)] i+ 
Where ¥ = var(R) and E = expt(R). If portfolio | has the same E anda 


EFFICIENT SET ANALYSES Mnaller V than portfolio 2, it would have a greater value of 


a2 


Let us imagine three Rational Men, each about to select a 
(WEF 


Each is attempting to maximize the expected value of a utility 
U(R). For the first Rational Man U(R) equals the natural logari 
(1 +R). Thus U, = Ln(1 + A), where U, represents the utility of 
to the first man, For the second Rational Man U(R) equals the 
root of (1+ 8); U,= VI + R=(1 + 2)" For the third Ri 
Man U(R) equals the cube root of (I +R); Us = (I +R) 


[Ln(l + £)} — 


Thus the portfolio which maximizes [Ln(1 + £)) — ((1/2)V/(l + EV) 
Must minimize V for some value of E. In the present example, the port- 
folio which maximizes [Ln(i + £)] — {(1/2)V/(1 + £)'] is represented by 
he of the points on the curve of Figure 4 in Chapter II. Since it is 
utility function of each of these rational men reflects his personal ‘A feasonably good approximation to Ln(| + R), we may expect that the 
ness and ability to bear risk. Each of these utility functions is a which maximizes (Ln(t + £)] — ((1/2)¥/(1 + £)*], represented 
conservative, concave type. iy & point on the curve, has an expt{Ln(1 + )] almost as great as the 
Suppose, for illustrative purposes, that Masimum obtainable. 

‘The restriction of choice, in short, apparently imposes little hardship on 
Or first Rational Man. 

Are our second and third Rational Men as fortunate? The function 
{\ + 2)? may be approximated by the quadratic 


(1) the portfolio is to be selected from the ten securities, includit 
used in Chapter II; and 

(2) the three men agree that the past returns presented in 
exactly reflect the probabilities of future returns. 


In short, we shall use the analysis of Chapter If to illustrate NCR KR 
concerning our three Rational Men. 

Now let us suppose that (for some reason) the Rational Men mi 
cone of the portfolios represented by a point on the curve in Fi 
Chapter II. The portfolios which the three men choose 
restricted are not necessarily the same as they would otherwise 
The portfolio which maximizes the expected value of Ln(1 + A) 
necessarily minimize variance for some value of mean, nor necel 
the portfolios which maximize the expected values of (1 + 
(1 +R)! The best portfolio on the curve may, in each case, 
expected utility than some portfolio represented by a point 
curve. 

Is the loss to the Rational Men great or small? Are they 
well off when restricted to portfolios represented by points on the 
they would be if not thus restricted? Or does the restriction 
serious hardship? 

To start, let us consider these questions in the case of the first 


function (1 + R)/ may be approximated by the quadratic 


1+ (1/3)R — (1/9)R?. 


curacy of these approximations is illustrated in Table 1. Its 
dratic approximation deviates from (1 + R)¥? by .00 or .01, to two 
imal places, for R from about a 50% loss to about a 707% gain. Ata 
of 60% the approximation is only .03 greater than (1 +R)“. Ata 
Wi) of 100% the approximation is only .03 less than (1 + R)”*. The 
ailratic approximation to (1 + RJ! is about as good. It deviates by 
Or .01 to two decimal places, from about a 40% loss to about a 70% 
). Ata 60% loss the approximation is .02 greater than (1 + R)!*. 
44 100% gain the approximation is .04 less than (1 +R), 

Hine these quadratic approximations to (1 + R)Y? and (1 + R)*# are 
lantially more satisfactory than the approximation to Ln(I + R), 
Wi expected values must be fairly good approximations to the expected 
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{haximizes the approximation gives a utility little less than the maximum 
btainable utility, 

The restriction to portfolios represented by points on the curve, there- 
fore, imposes little hardship on either the second or third Rational Man. 
Pach can find a portfolio which is almost as good as the best he could have 
Otherwise obtained, 

While it cannot be claimed that all such utility functions can be accurately 
spproximated by a quadratic, the quadratic nevertheless shows a surprising 
flexibility in approximating smooth, concave curves. Whenever the 
individual's utility function can be reasonably approximated bya quadratic, 
ine of the portfolios which minimizes variance for some value of expected 
Islurn provides almost the maximum obtainable expected utility. 

Each of our Rational Men has his own utility function. Each examines 

ihe curve showing efficient combinations of mean and variance. Each 
Wlects the portfolio most appropriate to his own needs. Each obtains a 
Portfolio almost as good as the very best available for him. 
This provides a viewpoint with respect to the actual use of efficient set 
Mulyses. If the person carefully selects among the efficient combinations 
W mean and variance, and if his true utility function is approximately 
iwidratic, then his final choice of portfolio is about as suitable as a port- 
lio can be. It is not essential for the investor to understand that he is 
Wilating a Rational Man maximizing a quadratic approximation to his 
WWeutility function. Tt is only necessary for him to understand what mean 
#\) standard deviation represent, and to consider his choice carefully. 

Our three Rational Men choose portfolios on the basis of utility fune- 
fioos U(R). Their utility functions U(C,, yj.1) must satisfy one of the 
Hnditions which permit a choice of portfolio without, at the same time, 
Wiviring a choice of C. Let us consider a fourth Rational Man whose 
WNC), v:.1) does not meet such a condition, For one value of C perhaps 
WN felationship between U and R is equivalent to Ln(1 + R). For 
‘Mother value of C perhaps the relationship between U and’ R is equivalent | 
Wl + 2). For a third value of C we may imagine that the relation- 
Wip between Uand Ris equivalent to (t + R)'/. Each of these relation- 
Whips can be reasonably approximated by a quadratic. Let us assume 
Wal, in general, for any choice of C the relationship between U and R, 
}y our fourth Rational Man, may be approximated by a quadratic. In 

fase our Rational Man can examine the curve in Figure 4, Chapter Il, 
determine 
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Taste | 
APPROXIMATIONS TO (1 + R)!? AND (1 + R)V? 


(+ Ry? (1+ R)"3 | Approx, 


values of (1 + A) and (1 + Ry! The expected values of 1 + 
— (1/8)R® and 1 + (1/3)R — (1/9)R® are, respectively, 


1+ (1P)E — (1/8)(V + E%) 


and 


1+ (1/3)E — (1/9 + E%). 


These are approximations centered about R = 0, correspon 
E = (1/2(V + £%) for La(1 + R). The approximations centered. 
R= E for (1 + R)! and (1 + R)*/? are, respectively, 


(1+ Ey! — (1/8)Ml + Ey and (1 + £E)Y9 — (1/9) ML + 


The latter approximations are presumably slightly better than the 
The portfolios which maximize any of the above approximations 
mize variance for some mean. In the present example, all are re} 
by points on the curve in Figure 4, Chapter II. Since each is a 
approximation to the utility function in question, the portfolio 


(8) the (approximately) best portfolio associated with each value of C: 
{h) the desirability of each such portfolio; and, hence, 
{f) the best combination of both C and portfolio. 
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Proor: That U = aR + 6  f(R) implies (i) and (i) is trivial, We shall prove 
the converse, that (i) and (ii) imply U= aR + 6-f(R). If (E, Fy) and (Ep, F,) 
‘ire the E. F combinations associated with two portfolios, the £, F combination 
Msociated with 


Thus the use of an efficient set analysis based on mean and variance 
not necessarily assume that the choice of portfolio is independent 
choice of C. 


4 probability p of portfolio 1 and 


UTILITY FUNCTIONS AND MEASURES OF RISK A probability (I — p) of portfolio 2 


Suppose a Rational Man chooses a portfolio solely on the basis i 
mean and variance. If two portfolios have the same mean and va) 
they are considered equally good. Since the value of variance 
derived from the values of mean and expt(R*), this Rational Man From this it follows that Axi ; 
be said to act on the basis of mean and exp\(A’). If two portfoli 1f, F). For example, if (e, ED & piety EF. pasties 
the same expt(R) and expt(R*), they have the same var(R) ai Aaiom II i 

assumption, must be considered equally good. a probability p of (E,, F,) and (| — p) of (E,, Fy) 

Ifa Rational Man maximizes the expected value of a quadratie Wh preferable to 


ae U=c + aR + hR®, then he acts on the basis of ex a probability p of (Ey, F;) and (1 —p) of (£3, F3). 
expt(R?), since i 
expt(U) = ¢ + aexpt(R) + bexpt(R’). Bi 51.5) Zia Fs) hen 
PEs, Fy) + (I — pXEs, Fa) > PE Fy) + (1 — pXEy Fs) 


Tt follows from the main result of this section that the converse 18 al 
If a Rational Man 


E* = pE, + (1 — plEy, 
FY = pF, + (1 — p)F. 


>0. 

Thus Axiom If applies to £, F combinations as well as to the underlying 
bability distributions. Axioms I and IIL can similarly be shown to apply 
‘he E, F combinations. These axioms, applied to E, F combinations, imply 
| these combinations can be ordered by a linear ordering function 


EU =aE + OF. 
EU is the expected value of 
U = aR—b-/(R). QED. 


{i) maaimizes the expected value of some utility function, and 
(ii) acts solely on the basis of expt(R) and expt(R*), then fi 
function is quadratic 


More generally, the theorem in this section is concerned 
measure of risk F which equals the expected value of some func 
The theorem asserts that 


(i) if (R)is a rule which associates « number / to each value 
(ii) if F is the expected value of /(R), 
(iii) if the individual does maximize expected utility, but 4 . x 
(iv) the individual acts solely on the basis of expected return The present and following section discuss six measures of risk: 

measure of risk F, then Tl) the standard deviation; 
(2) the semi-variance; 

1)) the expected value of loss; 
14) the expected absolute deviation; 
18) the probability of loss; and 

16) the maximum loss. 


JATED UTILITY FUNCTIONS 


the individual's utility function is of the form 
c+ aR + b- fia). 


Without loss of generality it may be assumed that the zero of 
scale is chosen so that ¢ = 0 

This relationship between utility functions and measures of Fi 
valuable in the subsequent discussion of alternative measures of 


fonsider the desirability of selecting portfolios on the basis of 
{ed return and one or another of these measures of risk. The above 
}) hot an exhaustive enumeration of all possibly worth-while measures 
Wh The measures listed are used to illustrate a method of evaluation 


th? to other measures of risk, and to measures of central tendency 


Tueonem: Let £ = expt(R), F = exptt/(e)]. An individual 
expected value of 2 utility function aR + 6- f(R), if and only if 
(i) he maximizes the expected value of some utility function, and — 
(ii) his preferences are based solely on E and F. 
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In the present section we consider the utility functions associated 
each of the six measures. In all but one case the associated utility fu 
of each measure can be inferred from the theorem of the last section, 
the one remaining case it is shown that no utility function can be con} 
with choice based on the particular measure, 

In the following section we consider the desirability of the dil 
types of associated utility functions, and hence the desirability of t 
measures of risk 

The Standard Deviation. 


We have seen that, if an investor 


(a) maximizes the expected value of some utility function, and 
(b) his choice among portfolios depends only on their expected 
and standard deviations, 


then his utility function must be a quadratic: U = c + aR + bY 
investor prefers smaller standard deviation to larger standard 
(expected return remaining the same), then b < 0. 

The quadratic can, in some cases, provide a satisfactory approxi 
to a concave utility function over a reasonable range of returns 
investor whose utility function equals either Ln(I + R), (I +R} 
(1 + 8)'2, and whose probability beliefs are similar to (cover al 
same range as) the past experience portrayed in the ten-security exal 
Chapter II, could find a quadratic approximation sufficiently close | 
true utility function that the portfolio which maximizes the former 
necessarily be excellent in terms of the latter. 

Figures 1a and Ib present two examples of quadratics. The ex 
Figure 1a is the approximation to the logarithm U = R — (1/2)R¥ 
in Figure 1b is the quadratic U = R — 2.3K. The latter qui 
associated with the choice (from among the portfolios represented | 
line in Figure 4, Chapter Il) of the portfolio with 


expected return = .1, and 
standard deviation = .11. 


The efficient portfolio with this combination of expected return @ 
dard deviation, in other words, is also the portfolio that gives 
expected value of the utility function presented in Figure Ib. 
The quadratic in Figure 1b reaches a maximum at R = 22. 
greater than .22, utility decreases as R increases. At least for 
this quadratic is an extremely bad approximation to an inves 
function, 
Every quadratic c + aR + bR®, with 6 <0, reaches such a 
In some cases this maximum occurs above the range of “relevant 
In other cases the maximum occurs within the range of “relevant 
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‘The upper limit of the range of “relevant returns” may be defined as the 
preater of 


(a) the largest return experienced by the portfolio which maximizes the 
Approximate utility function, and 


(b) the largest return experienced by the portfolio which maximizes the 
true utility function, 


As long as the true and the approximate utility functions are similar up to 
(a) or (b), whichever is larger, the portfolio that maximizes the approximate 
Wility function will also nearly maximize true utility function. 


@ 


Figure 1. (a) U= R —(1/2)R*, (b) U = R = 2.38%, 


the utility function in Figure 1b, maximized by a portfolio with expected 


Mlurn 1 and standard deviation .11, has its maximum within the range of 
Wlevant returns. The values of expected return, and one- and two-unit 
‘indard deviations, are indicated along the horizontal axis of Figure 1b. 


The maximum of the quadratic occurs, not well beyond E + 2s, but at 
Re E+ (1.10. 


The seriousness of such a maximum within the range of relevant returns 


W discussed in the next section, 
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The Semi-variance. The semi-variance S, is the expected value 
[min(R — 6, 0)}*. Figure 2a shows the curve 


U = — [min(R — 22,0). 


Except for choice of zero and unit, for R < .22 the curve in Figure 
exactly the same as that in Ib, Thus, for choices only involving ret 


u 


et 
40-3 -2 =1 


co) 


Figure 2. (a) U = — (min(R — 22, 0)]7; (b) U = (.1)R — (min(R — 1, 
© U = (DR — [min(R — 3, 0? — 03. 


below .22, the curve in Figure 2a gives exactly the same results 
1b. For return greater than .22, however, utility remains © 
Figure 2a rather than falling as in Figure 1b. The curve in Fi 
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Clearly a more desirable type of approximation 
function than is that in Ib. 


For any quadratic (c + aR + bR', 6 <0) it 
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to an investor's utility 


is possible to define a 


number d so that the semi-quadratic curve [min(R — d), 0}* is (except for 


choice of zero and unit) the same as the quadratic 
Until the latter reaches its maximum, and then 
it remains constant rather than falls. If the 
Maximum of the quadratic occurs within the 
fange of relevant returns, the semi-quadratic is 
tore desirable than the corresponding quadratic, 
If the maximum of the quadratic occurs above 
the relevant range, both the quadratic and the 
jemi-quadratic are equally good. 

Thus, if choice were restricted to portfolios 
Which minimized S,, no attention being paid 
lo expected return, the best portfolio available 
Under such a restriction would be at least as good 
§s the best portfolio produced by an analysis 
sed on expected return and standard deviation. 
If an analysis considers expected return as 
Well as Sq, minimizing the latter for given values 
Of the former, the associated utility function is 
f the form 


Um c+ aR + blmin(R —d,0)}?, for b <0. 


As illustrated in Figures 26 and 2c, this func- 
Vion is the conservative quadratic for returns less 
than d; it is the more devil-may-care straight 
Whe for returns greater than d. 

Expected Loss. Loss may be formally de- 
fined as —min(R, 0). Thus, when R>0, 
Wis = —min(R, 0) = 0; when R<0, loss = 
 min(R, 0) = —R. Expected loss, then, equals 


expt{—min(R, 0)]. 


Hf an investor 


U 
@ 
u 
R 
) 
u 
R 
© 


Figure 3. (a) min(R, 0); 
(b) R + min(R, 
(©) R+2min(R,0). 


{) maximizes the expected value of some utility function, and 


{b) chooses among utility functions solely on the basis of expected 


Murn and expected loss, 
then his utility function must be of the form 


U=c+aR + bmin(R, 0). 
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Figures 3a, 3b, and 3c show, respectively, the functions mint 
R + min(R, 0), and R + 2min(R, 0). 

The function c + aR + min(R, 0) consists of two line segments 
meet at R = 0. To commit oneself to choose a portfolio which mini 
expected loss for some value of expected return is the same as to Co} 
oneself to choose a portfolio which maximizes a utility function simil 
those in Figures 3a, 3b, and 3c. The desirability of thus com: 
oneself is discussed in the following section. 


investor acted solely on the basis of expected returns and expected absolute 
deviations of portfolios, then his utility function would have the form 


c+ aR +O). 


=|R|. R= (1/2)|R], R —|R), and R — 2|R| are shown in Figures 4a, 4b, 
4c, and 4d, respectively. If |b] a= 1. ¢ + aR + d|R| reaches » maximum 
R= 0. If [| a < 1, the utility function associated with expected 
absolute deviation is of the same 

form as that associated with expected u 
loss. iB 
If action were based on the expected 
Value of the absolute deviation about 
jome number d, expt|R — d|, then the 
Wsociated utility function would be of 
the forme + aR +b|R —d|. This would 


be represented by two straight line seg- a 
Ments which meet at R = d. ib 
Probability of Loss. The probability 


@ 


(a) a that R will equal zero or less may be R 


‘Mpressed as the expected value of a 
function g(R), where oa aint 
0 ifR>0, 
R) = 
&(R) {i ifR<0. 5 


Figures Sa, 5b, and Sc, respectively, 
went g(R), R—(1/2e(R), and R— 
(R). Like the two measures last dis- 
‘Wissed, the utility function associated 
with “probability of loss” consists of two 
‘Maight line segments, Unlike those of 
the (vo previous measures, the straight Figure 5. 
Hines are parallel and are separated by a (b) R — (I/2)g(A 
Mcontinuity at R = 0. 

Maximum Loss. fan investor chooses among portfolios solely on the 
Wwsis of expected return and maximum loss, insisting on a minimum loss 
for any given value of return, then the investor's actions could not be 
Wied on a single stage utility function U(R). This may be shown by 
‘Hhibiting a contradiction between maximum Joss and the axioms of 
pected utility presented in Chapter X. Suppose that two probability 
Witributions of return (e.g., those associated with two portfolios) have the 
Sine expected returns but different “maximum losses.” We may assume 
{Mt maximum loss M,, associated with probability distribution 1, is 


«© (a) 


a(R); 
3 (©) R~ 29(R), 


Figure 4, (a) — [A]; (b) R-(12)|A]; () R-|R|; 1d) R= IA 


Rather than expected loss, one could use expected return 
expt[—min(R — 6, 0)]. The utility function associated with, 
based on this measure is, again, a pair of connected stral 
segments. These line segments, however, meet at R = 6 rather 
R=0. 

Expected Absolute Deviation. |R| equals R when R is gi 
zero; equals minus R when Risless than zero. The expected val 
absolute deviation from zero is defined as expt|R]. Ifa utility 
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greater (greatest loss is more negative) than that (M,) associated 
probability distribution 2. By hypothesis, probability distribution 2 
preferred to probability distribution 1. The probability distribu 
associated with a 50-50 chance of distribution 1 or 2 has the same expect 
return as distributions 1 and 2 and has the same maximum possible loss 
distribution |. Since the 50-50 chance of distribution | or 2 has the 

expected return and maximum loss as does distribution 1, and since tf 
are the only properties of the distributions the investor considers, 
investor must be indifferent between distribution | and a 50-50 chan 
lor2. But, since distribution 2 is preferred to 1, this contradicts Axi 
of Chapter X. 


THE EVALUATION OF MEASURES OF RISK 


The assumption that the investor acts on the basis of some single 
utility function U(R) rules out the possibility that he acts on the 
expected return and maximum loss. A combination of expected 
and any of the other five measures of risk, on the other hand, woul 
consistent with the existence of a single stage utility function U(R). 
has an associated utility function. A portfolio which gives an ef 
combination of expected return and probability of loss, for exam) 
also the portfolio which maximizes a utility function similar in { 
those illustrated in Figures Sa to Sc. To restrict oneself to por 
which are efficient in terms of expected return and probability of 
also to restrict oneself to portfolios which maximize one or another of 
utility functions. If we judge the desirability of the latter restriclia) 
simultaneously judge the desirability of the former restriction. 

There are a number of ways in which the associated utility funet 
Figures | through 5 differ from the concave utility function of Fi 
Chapter X. Several of the curves have one or two long linear seg 
one has a discontinuity; one always reaches a maximum, By eval 
the seriousness of these “peculiarities,” we can evaluate the utility 
tions, and hence the associated measure of risk. 

Linear Segments. The utility functions associated with “expected 
“expected absolute deviation,” and “probability of loss” are made 
two straight line segments. The utility function associated with 
variance S, is linear for return greater than 5. 

Let us consider changes in probabilities within a region where & 
function is linear. For example, let us consider the effect of ef 
probabilities on the expected value of the utility function in Figuie 
the region R <0, while probabilities of returns in the region 
remain unchanged. 
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For such changes in probabilities, expected utilily (nufeuses Wf aiid onl 
WW expected return increases. If the utility function Ware #8 1h Pique 
the investor would be indifferent between 


(i) a .S probability of a 10°% loss, and 
(it) @..1 probability of a 50% loss and a 4 probability of no Change 


A linear segment over the region &<0 is quite objectionalie, A 
Consumer with a linear utility function for R<0 would never buy ine 
surance. If the insurance company had no expected gain al all, the ea: 
Sumer would be indifferent between insuring and not insuring, If \he 
(surance company had the slightest expected gain, the individual would 
prefer to take the chance of a large loss rather than pay the premium A 
Portfolio analysis based on such a utility function might similarly change a 
Substantial loss in fortune rather than select a much more conservative 
slternative with slightly less expected return, 

Every point on the curve relating minimum expected loss to expected 
Jelurn corresponds to a utility curve as in Figure 3. No matter how 
“conservative” the point chosen, the associated utility function is still 
linear in the region R<O. It still may have a foolhardy lack of con- 
‘ervatism. The portfolio chosen on the basis of an analysis implicitly 
suming a linear utility function for R <0 may be worse than that which 
Would be chosen by unaided common sense. Common sense might avoid 
Major pitfalls to which such an analysis is insensitive. 

Linear segments for returns greater than 0, or greater than some larger 

Humber 6, are not as objectionable. The utility function in Figure 2b, for 
*Mample, has a conservative attitude toward losses, but a more devil-may- 
fie attitude about gains greater than 30%. As long as it does not affect 
the probabilities of gains of 30°% or less, the utility function in Figure 2b 
Welers changes which increase expected return. This does not seem 
Wreasonable, even if not precisely an optimum, 
On the basis of the linear segments of their associated utility functions 
fv R <0, we must reject expected loss, expected absolute deviation, and 
Probability of loss. We might attempt to avoid the difficulty in the case 
‘expected loss by using expected return below 6. To affect the previous 
Mpument, 6 would have to be negative. If 5 were quite negative, an 
WWidesirably large range of returns above 6 (including a wide range of 
‘ises) would be subject only to the expected return rule. If 6 were not 
Wsige, the original argument would apply with little abatement. Similar 
Mjuments apply against the use of expected absolute deviation from b or 
Probability of return below 5. 

The argument against utility functions consisting of two line segments 
floes not imply chat all piecewise linear functions should be rejected. The 
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utility function in Figure 6, composed of a number of small line segme! 
gives essentially the same results as a smooth curve drawn through 
segments. 

Discontinuities. A curve with a discontinuity can never be a co 
curve. This implies that there is some risk situation which the int 
prefers to having the same expected return with certainty. With the ¢ 
in Figures 5b and Sc, for example, a 50-50 chance of a 60% gain oF 
would be preferred to zero return with certainty. Such preferences 
lead us to reject this type of cf 
even if it had not been previ 
rejected on other grounds, 

Small discontinuities in an 
wise reasonably shaped curve, 
ever, can be as excusable as Summary. There is always a portfolio efficient in terms of expected re- 
small linear segments of the cu turn and S, (for suitable b) which is at least as good as the best portfolio 
Figure 6. based on analysis using expected return and variance. It is not unlikely, 

Non-monotonicity. The utility however, that the portfolio produced by an analysis based on expected re- 
tion in Figure 1b reaches a ma (urn and variance is better than one formed by unaided judgment? The 
within the relevant range of Associated utility function can provide insight as to the suitability of the 
The curve is clearly a bad a Portfolio based on such an analysis. Portfolios selected on the basis of 
mation to a reasonable utility ¢xpected loss, expected absolute deviation, or probability of loss are not to 
Figure 6. A piecewise linear function. tion for returns greater thal be trusted. They can be foolishly speculative even when apparently con- 

‘trvative. The assumption thata utility function exists rules out maximum 
loss as a measure of risk. 

An analysis of desirability based on associated utility functions can be 


Applied to other measures of risk, and to measures of central tendency as 
well 


(b) it has a standard deviation of return equal to 11 % 
(c) it has highest expected value of the utility function in Figure 1b. 


Since the utility function in Figure 1b drops quite rapidly as we move in 
the direction of increasing losses, since expected return minus two stan- 
dard deviations is only a 12% loss, we can be assured that large losses are 
carefully avoided. Nevertheless, expected return is 10%. It seems 
unlikely that unaided judgment would have chosen an equally conservative 
eombination of the ten securities with a higher long-run return. 
Conceivably a similar argument applied to a portfolio from another 
E, V analysis might not produce an equally encouraging result. At least 
this approach provides some check on the suitability of the results of such 
an analysis. 


maximum 

The objection to such a quadratic is different in nature from the 
to curves which are linear for R< 0. In the latter case the utility 
tion may be excessively speculative in the face of losses. The qui 
‘on the other hand, while conservative in the face of loss, may fi 
ignore opportunities for greater gain: 

In the case of the measures whose utility functions were Ii 
R< 0 we argued that a portfolio selected on the basis of such an 
could be undesirable—substantially worse than that based on 
thumb judgment. This is not so for the portfolio corresponding 
ten-security analysis) to the quadratic in Figure 1b. The inf 
presented by the curve in Figure 4, Chapter II, is sufficient to indica 
the portfolio is “fair to good” and is probably better than could be 
‘on the basis of unaided judgment. Consider what we know al 
portfolio selected and the investor who selected it: We know 1} 
investor is quite conservative—the utility function in Figure 1b 
quadratic which comes closest (for present purposes) to represe! 
true utility function. Concerning the portfolio we know that 


GENERALIZATIONS, 


In this chapter we have thus far assumed that 


(1) the investor owns only liquid assets; 

(2) he maximizes the expected value of a multi-time period utility 
Mnction U(C,, Ca,* + +, Cp), where C, is the money value of consumption 
{perhaps adjusted for cost of living); and 


(3) the set of available probability distributions of returns from port- 
folios remains the same through time. 


' Throughout this discussion we assume, of course, that the means, variances, and 
‘Myariances are probability beliefs rather than simply past performance, 

‘This is based on the assumptions that we have the investor's, or investment man- 
H's, probability beliels, and that the conditions implicit in the analysis (e.g., those 
‘Hwmerated in the first section of the chapter) are substantially correct 


(a) it has an expected return equal to 10%; 
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We shall now discuss cases in which one or more of the above assumpli 
does not hold, Specifically we shall consider instances involving 
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(a) consumer durables; 

(b) non-portfolio sources of income; 
(c) changing probability distributions; 
(d) illiquidities; and 

(ce) taxes, 


In each case we shall discuss how the additional consideration chan, 
single period utility function, and how the efficient set analysis cal 
accordingly modified. 

In the instances considered, our approach leads to one of three 
conelusions: 


i. No modification is needed. The additional consideration ij 
which the investor should take into consideration in selecting a po! 
from among those offered by an ordinary efficient set analysis, 

ii, Modifications are required and are feasible. Portfolios 
efficient while ignoring the consideration may be inefficient w 
consideration is introduced. The consideration may be introdu 
the efficient set analysis without excessive cost. 

iii, It is impossible or patently uneconomical to introduce the 
fication in a theoretically correct manner. Perhaps formal anal) 
be valuable, however, if modified to account roughly for the consid 
in question. 


If we tried to apply our efficient set technique to the general pi 
business uncertainties through time, we would probably come to # 
conclusion: 


iv. The particular mode of analysis is too removed from the 
the subject to be bolstered by intuitive modifications. Another aj 
(such as Monte Carlo analysis [18]) merits consideration, 


Consumer Durables. Now let us admit the possibility that the 
owns “consumer durables” while continuing to assume thal @\ 
probability distributions are constant through time. Consum 
bundle of goods and services “enjoyed.” Consumption at any 
subject to constraints set by 


(a) the amount of money devoted to current consumption; 
(b) the amounts of various kinds of consumer durables avail 


We continue to assume that the portfolio is the only source of 


APPLICATIONS TO PORTFOLIO SELICTION. 


The investor faces a complicated set of decisions. He mth We 
funds between portfolio and non-portfolio expenditures, witli 
folio he must allocate funds among a large number of securities) ihn 


non-portfolio expenditures he must allocate funds between various (ois 
of immediate consumption expenditures and durable purehaies He dies 


this to maximize the expected value of a single stage utility function wiieh 
depends on 


(a) current consumption, 
(b) amounts of durables at the end of the period, and 
(c) the value of the portfolio at the end of the period. 


In the terminology of the last section, (a) is current consumption ©; 
(b) and (c) identify the next wheel w41. 

If, for any given (a) and (b), utility is a quadratic function of the value 
Of the portfolio, then all the investor need know concerning available 
portfolios is summarized in an E, V efficient analysis. Given one set of 
Non-portfolio decisions, one E, V efficient portfolio maximizes expected 
wility. For another set of non-portfolio decisions, perhaps another £, V 
efficient portfolio maximizes utility. Given the whole set of £, V efficient 
fombinations, the rational investor can determine which portfolio is 
optimum while considering both his portfolio and non-portfolio 
Opportunities. 

Exogenous Assets. Suppose that the portfolio is not the investor's only 
Source of income, but that he has additional income, e.g, in salary form, 
We shall assume that decisions with respect to employment will not be 
immediately influenced by available portfolio opportunities. The indi- 
Yiual has decided to retain his job whatever the outcome of the portfolio 
Malysis. 

The probability distribution of returns from other sources is thus assumed 
fiven, but it is not irrelevant to the selection of a portfolio. Stability of 
felurn as a whole depends, not only on the variance of portfolio return and 
‘arlance of non-portfolio return, but also on the correlation between these 
two. This may be considered in an efficient set analysis by treating non- 
portfolio returns as an “exogenous asset”—an asset whose amount cannot 
he varied in the portfolio analysis, but whose return is random like that of 
‘Ihe other assets. 

Changing Distributions. Let us consider a particular case of probability 
Hhiributions changing through time. Specifically, suppose that available 
Nwvibutions depend on some number 6 which measures the “bullish- 
Ws" or “bearishness" of the market. For given decisions concerning 
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non-portfolio expenditures, the single stage utility function to be 
mized depends on the value of the portfolio ¥ and the level of b: 


U= UY, b). 
If Uis approximately quadratic, that is, 
U=Vtab+PY? t ybY + 6% 


is likely to differ from the actions of the rational man, The procedure 
consists of: 


1. Choosing a “period of analysis” (¢.g., 3 months, @ year, 3 years); 
the means, variances, and covariances are defined for this period. The 
mean and variance of the portfolio are those which would apply if we 
bought the portfolio now and held it until the end of the period of analysis. 

2. Moving slowly, on the basis of judgment, in the direction of the 
desired portfolio. The desired portfolio itself may be modified, on the 
basis of judgment, to account for considerations not incorporated into the 
formal analysis. 

3, Periodically recomputing the formal analysis. Even if the “period 
of analysis” were 3 years, the “frequency of analysis” might be 3 months to 
indicate changes in the desired portfolio based on accumulated informa- 
lion. Stand-by analyses might also be performed in advance, to indicate 
how the desired portfolio is to be modified if some major event (war, 
precipitous drop in market, threatened major strike) should occur. 


then expected utility is 
EU = E + a£b + BEY? + yEbY + OEY*. 


The choice of portfolio cannot affect £b and Eb? The select 
portfolio depends on the mean and variance of Y and the corel 
between Y and 5. Efficient combinations of E, EY*, and EbY 
generated by treating b as if it were an exogenous asset. — Several o1 
efficient set analyses, with greater or lesser weights on in the value 
portfolio, are required to survey possible combinations of expected 
variance of return, and correlation with 6. 10 

Mliquidities. Securities are not, in fact, perfectly liquid. Fi 


i Such a procedure may be expected to produce near optimum results if 
moment we shall consider two sources of illiquidity: 


(a) among different securities, long-run means, variances, and co- 
Yiriances are about proportional to short-run means, variances, and 
fovariances, and 

(b) the portfolios which seem most desirable today are most likely to 
‘eem desirable “tomorrow.” 


(i) costs of transactions, and 
(ii) thinness of the market. 


A third source of illiquidity, the capital gains tax, is discussed la 
If securities were in fact perfectly liquid, a Rational Investor 
move instantly from his present portfolio to his desired portfol 
would change portfolio with each change in belief or need. If 
were to follow such a procedure, the above-mentioned illiquidit 
cause him to fare quite poorly. Constant buying and selling 
accumulate transaction costs; attempts to make rapid radical ol 
the portfolio, to buy or sell large amounts of a security, would: 
affect the price for the iarge investor, The Rational Investor mul 
more slowly toward his desired portfolio because of the existence 
illiquidities. f 
We cannot solve the problem of the optimum strategy over 
illiquidities. Even if it were feasible, it would be uneconomic {0 
moment-by-moment, or even day-by-day, the optimum seq) 
timing of purchases and sales for a large portfolio. If formal a 
to be used at all, it must incorporate some explicit or implicit 
on the proper procedure in the presence of both illiquidities and 
capability of performing analyses. , 
ies ie ane pass is be a “natural” procedure in 
portfolio analyses, and then consider how the outcome of such # 


I (a) and (b) do not hold, we should consider whether some intuitive 
Modification of procedure would not produce more optimal results. 
Wppose, for example, that'a particular security does not seem likely to be 
orth holding for the entire “period of analysis” (say 2 years) but seems to 
Wve some desirable short-term possibilities (for 6 to 12 months), This 
fonsideration could be incorporated in one of two ways: One would be to 
Bere the analysis without taking the exceptional short-run possibilities 
WO account, then intuitively increasing the amount allocated to the 
Particular security. A second way would be no! to let the security's 
‘ein, variance, and covariances reflect probabilities of outcomes over the 
period of analysis”; but, rather, let them reflect the probabilities of gain 
Wy loss if: (i) the security is purchased now and held until itis propitious 10 
Wil (either because the time is right to take a gain or cut short aloss); then 
1H) (he proceeds from this sale are reinvested in the portfolio until the end 
the “period of analysis.”* 
Other failures to meet condition (a) or (b) above can be similarly treated, 
Hy adjusting either the inputs or the outputs of the analysi 
‘Two questions which must be settled before undertaking an analysis are 


302 PORTFOLIO SELECTION 


“What should be the frequency of analysis?” and “What should be 
period of the analysis?” No precise answers can be given to U 
questions, but factors meriting consideration can be pointed out. 
frequency of analysis should depend on (among other things) the cost 
analysis and the rate at which new information accumulates. It 
information accumulates slowly, if few serious changes in probal 
beliefs occur from month to month, then a quarterly analysis may be 
that is needed (even if the cost of the analysis were quite reasonable), 
were felt that a quite complicated, fairly expensive analysis is absol 
essential for the particular investment situation, then perhaps a 
analysis would be appropriate. 

Between dates of successive analyses, judgment would be requi 
revise the “desired portfolio” to account for new information a 
determine the rate at which the desired portfolio is to be app: 

Taxes. Taxes would pose no special problem of analysis if it were 
for the provision that capital gains are taxed only when “realized” 
only when the security is sold). Without this consideration we 
simply define means, variances, and covariances in terms of return 
taxes, and perform an ordinary efficient set analysis with our inputs 
defined. Even if tax rates changed with changes in the level of ine 
E, Voran, S, analysis could be performed using procedures sim! 
those required for deriving an E, 5, efficient set. 

Problems of lliquidity arise, however. when capital gains are taxed 
realized. The utility of a portfolio to an investor depends not s0 
its market value, but also on whether taxes would be incurred if 
particular securities in order to move from his present portfolio to 
desirable one. Even if there were no other illiquidities, taxes on 
capital gains would retard movements from present portfolios to 

portfolios. 

We cannot calculate exactly the manner by which the portfolio 
be modified to account for capital gains provisions. We shall, 
offer a possible method of handling this consideration and di 
judgment might improve the results. Perhaps future conceptual 
plus practical experience can produce improved procedures in this 

Proposal: Define the “worth” of a security or portfolio at (he 
the year to be 

(the market value of the portfolio) minus 
(taxes payable on realized gains and income) minus 


(b) times (unrealized capital gains) plus 


(c) times (unrealized capital losses), 
where b is a constant reflecting the “likely” tax incurred next 
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dollar of unrealized capital gain at the end of this year, and c is a constant 
feflecting the “likely” tax savings from realizing capital losses next year per 
dollar of unrealized capital losses at the end of this year. and c must be 
between zero and the capital gains tax. For fixed 5 and c, estimated 
perhaps on the basis of past realized capital gains and losses, an ordinary 
Analysis could be performed in terms of the means, variances, and 
covariances of worth thus defined. 

The means, variances, and covariances of return that serve as input to 
the analysis, or the portfolio which is the output of the analysis, may be 
modified by judgment to reflect, for example, that capital gains on some 
particular security are more likely to be realized than they are on the 
average security. 


‘The Crotce oF ANALYSIS 


In designing a portfolio analysis, two objectives should be kept in mind. 
First, an attempt should be made to keep the analysis simple; second, an 
‘tempt should be made to understand the salient implications of major 
simplifications, 
Simplicity of analysis means less work in developing inputs, less com- 
puting time in obtaining the outputs. The simpler analysis, done in less 
lime for less cost, can be revised more frequently. 
Simplicity, however, can be bought at too high a price. We could 
freatly simplify the analysis, for example, by considering the maximization 
of expected return alone. But our analysis, showing a one-securit 
portfolio as optimal, would be of little aid to judgment. itdanert 
{among its other applications) must be used to decide what refinements are 
a incorporating into the analysis. 

ie Tigorous mathematical analysis, the electronic com 
lens of thousands of calculations er second, the Riera cd 
Manager who can grasp intuitively a wider range of considerations than 
fen ever be incorporated into a computer—each has a role to play in the 
jupervision of large portfolios of securities. 
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PORTFOLIO SELECTION uw 
The expected return E of a portfolio is 


E=yp'r, 
The variance of return V of a portfolio is 
APPENDIX A Vs X'CX. 
An E, V combination (£9, V*) is obtainable if there is « legitimate 
THE COMPUTATION OF EFFICIENT SETS portfolio X° with 
Pape 
and 
Voi XE ONE! 
Tue £, V Erricient Set PROBLEM An E, V combination (E°, V°) is efficient if 
; ; (1) E°, V®is obtainable, and 
f Appendix A is to show that the computing proc é ately ; 
etre mai aie the desired efficient sets, It is assi (2) no obtainable combination £', V? exists, such that 
te the reader of this appendix is familiar with matrix algebra either E1> £° and VI< V° orelse E1>£ and VI < V9, 


Lagrangian multipliers. It is advisable to read Chapters VII and VII PP ionincrns 
way of introduction and illustration, before reading this appendix. efficient. 
We shall essentially employ the notation of Chapter VIII. Speci Our problem is to find 


is legitimate and if its £, V combination is 


x, (a) the set of all efficient £, V combinations, and 
7 (b) a legitimate portfolio for each efficient E, V combination, 


is a portfolio. 
¥ Tue Critica, Line MerHop 
- nein As has been asserted before and will be shown below, the “critical line” 
legitimate if it satisfies constraints mh ‘ i 
aA por folic frelliratc eer, computing procedure of Chapter VIII traces out a continuous piecewise 
5 4 ws linear set in the space of portfolios. Each portfolio in this set is efficient; 
29, 


every efficient E, V combination is represented by some portfolio in this 
where 4 is an m by 2 matrix and 6 is an m comporent column set. After the set in the portfolio space is traced, the set of efficient E, V 


1, is the expected return on the jth security; 0,, is the covariance combinations can be readily ascertained. ' : 

the jth and kth securities: The linear segments in the portfolio space are computed in a series of 
mn Merations. On the sth iteration, that is, along the sth segment, certain 
* variables are referred to as in; the rest are referred to as ovr, We shall let 

F(t) = hy > is) 

be the indices of the set of variables which are in. 

The critical lines, segments of which make up the aforementioned piece- 


wise linear set of efficient portfolios, are defined in terms of the following 
vectors and matrices: 


‘ das CHAS 
Ri F M= . 
oy° a) 


The covariance matrix C is symmetric and positive semi-definite, Where C and 4 are as defined above, and 0 is an m by m zero matrix, 
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0 The computing procedure of Chapter VIII traces the efficient set in tle 

Rise f ( direction of decreasing 4y. The rth segment in the Apes 
b. 


consists of points on the sth critical line from Ay Af", prevlautly 
determined, to Ap = 2. AQ) is the largest value of Ay which sitishes 


@) MY NO, and 
(b) either 


where b is as defined above, and 0 is an n component column vector 


zeros. 
fe 

s="} 
0 


where is as defined above, and 0 is an m component column vector 
05. 
= variables are in on the rthiteration. Myiy isan (im + J,) by (m+ 
matrix obtained from M by deleting rows and columns of our vara 
the jth column and row are deleted from M to form M y,,) if and oy 
J4¢ Fl). Sgin is an (m + J,) component vector obtained He 
deleting the y, if  ¢ (0), that is, by deleting the p, of out variables, 


0 
Ryn = b 


is an (m + J,) component vector, where 0 is a J, component zero 
X 444, is the column vector of in variables. ¥ 5... is the column vee! 
out variables. 4 is an m component vector of Lagrangian multi 
Jp is an (m + 1)st Lagrangian. : § 
“The formula for the rth segment of the set of efficient portfolios i 


(i) X,,=0 for some je £(t), or 
Gi) 7), =0 for some jy ¢ $(1), oF 
Gi) Ap = 0. 
In case (bi), jo “goes out”: the in variables on the (f+ Ist iteration are 
all those which were in on the (th iteration, with the exception of jy. In 
case (bi), Jo “goes in": the in variables on the (/ + 1)st iteration are all 
those which were in on the /th iteration, Plus jy. In case (bili), the com- 
putation terminates at ly = 0. 
For the first iteration, 


HY 2 i =o. 


Te NRLQ Parer 


The critical line procedure outlined above was presented in [11], Our 
Present discussion will draw ona number of results in that paper. In {I], 
restrictions were placed on the matrices A and C which are not, in fact, 
‘required by the method. The purpose of the present discussion is to show 


x that the method still produces the desired results when these restrictions 
on 
(Ia) M yn [ = Ryn + PeS gun are removed. 

i The exposition of the critical line method tends to become cluttered by 
(ib) Xyay =0 (edious special cases. These “special cases” lead to little, if any, change in 


computing procedure, but they must be covered to ensure that the method 
works in all circumstances." 


‘The treatment of a number of special cases in [11] did not depend on the 


i i to this assul 
Or, assuming that M,,,, is non-singular (we return to | 
later), and letting Mi = N(1), (1a) and (1b) may be written 


Xs6 (estrictive assumptions about A and C, Those special cases will not be 
(2a) [ sar NOR OF AXNOS seo Meated here. Specifically the reader is referred to {11} for the following 
eases. 
ay) 2 0. 
(2b) Kyau 1. The linear programming problem 
We define 


y= Lake + Lads — Anes maximize E = p'X 
£ n 


subject to AX = b, 
x20 
X= 0 for jd £0), ' Both the present discussion and that in [11] ignore the round-off error problem, 
0 forje £00, Fsperience with the simplex method of linear programming suggests that double. 
me 2 


¥écision floating point arithmetic should be adequate for problems involving 200 (and 
Latest ny fethaps more) securities, 


(a) and (1b) may be written 
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ite solution; either because In the section below on “The Efficiency of the Generated Portfolios" we 
has nol ftersgiationyad show that, if the method “works,” then it traces the desired set of efficient 
(a) there is no X with AX = b, ¥ > 0, or portfolios. In the three succeeding sections after that one, we show that 
(b) obtainable £ is unbounded. the critical line method does “work.” 


2. The linear programming problem in 1 


(a) does not have a unique solution, or THE KuHn, Tucker Paper 


(b) its unique solution is degenerate. The Kuhn, Tucker results on non-linear programming (19] are ex- 


tremely valuable in discussing the critical line method. We will not 


review their general theorems but will state a corollary with immediate 
applicability to our problem. 
Suppose that we wish to choose ¥ so as to minimize 


S=BX'CK +X 


3, The computation to determine A, 
(%, and/or 7,) become zero simultaneously. In short, our oa 
cussion will assume that the linear programming problem to maximize 
has a unique, non-degenerate solution, and that a unique ‘X00 n; 


reveals that two or more varial 


zero to determine 1}, 


[11] discusses the construction of the first critical line, as deseri subject to 
Chapter VIII, step 2, page 177. [11] shows that on the first critical 


under the non-degeneracy assumptions listed above, not only do we 


X,=0 forj¢ (1), and 
9, =0 for je S(1), 


AX =6, 
x50, 


where y is an n component column vector whose typical element is y, and 
where C, X, A, and 6 are as defined above. Let 


HX, dM) = f+ HAX, 


where A is an m component vector of Lagrangian multipliers whose ith 
clement is 4. We define 


O6(X,A) 
y= Sy Leake t +E ey 
oe : 


According to [19], X minimizes f subject to AX = b and X >0 if and 
only if 


(a) AX = 6, X >0, and 
(b) there exists a vector A such that 


but also, for sufficiently large 4p, we have 


X,>0 forjeJ( 
ny>0 for j¢ (1). 


Also 
M yay is non-singular. 
We shall say that the critical line method “works” (in the sense 
prescriptions can be carried out) if 
ene " 
(1) at cach iteration, after the new j “goes in’ or “goes out, 
My, is non-singular; ante 9 
é the SOHC at Ap = Aj, on the rth critical line is the same ay 
i i ob 
the (1 + H)st critical line at Ae = Ap; 
(3) n,>0 forj¢ F(t), and X,>0 for je J(0) 
when 


(i) 1,20 forj=1,-- +n, and 
Gi)’ 94K, =0 forj=t,-- +m. 


Condition (bii) implies that if n, > then X, = 0, and conversely if 
X,> 0 then n, = 0. 

There are no requirements on the signs of the 3, since the Lagrangians 
Me associated with equalities rather than inequalities. 

The Kuhn, Tucker results presented above are closely related to the 
older applications of Lagrangian multipliers. We shall use the fact that 
# necessary and sufficient condition for X to minimize 


SaBX'CK + y'X 


7-9 > ag> WY 
on the sth critical line,' and : 
(4) AZ = 0 is reached for some finite 7. 


1 Inthe degenerate case, 2 must be replaced by 2”: 
120 for j¥ Fy 
X20 for je FW). 
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subject only to 
AX=b 
is that 
(a) AX =, and 
(b) there exists A such that 
Of + VAX 
n= Tr Tar oo for all j. 


THe EFFICieNcy OF THE GENERATED PORTFOLIOS 


This section shows that, if the critical line method works (as define 
the section on the NRLQ paper above), then 


(a) all portfolios generated by the computation are efficient, and 
(b) if (E®, V9) is an efficient E, V combination, then there is a port! 
X° generated by the computation which has 


Eo = px, 
Vi XK 


The critical line procedure (if it works) associates a portfolio Lo 
value of 4g > 0; that is, the procedure traces a single-valued fu 
Rag). We shall first show that 


Lemma: If Ag > Oand 


xt = X04), 
then X* minimizes 
f=V-IE 
subject to 
AX = b, 
X>0. 


According to the Kuhn, Tucker results, a necessary and sufficient co! 
for X* to minimize fis that 


(a) AX = b, X > 0, and 
(b) there exist 4 such that 


Gi) 20 
(i) Xnj=0 forja leon 


for j=l: +m 


where 2 
n= z On Xy — Hide + ¥ ake 


a 


PORTFOLIO SELECTION 
These conditions are met by X* = 2(Ap) since 
AX = b by construction; 
14 2 0, Xj 2 0 by definition of “to work"; and Xm, = 0, sie 
X,=0 forjyf, and 
m= 0 forje SF. 
Tueorem 1: If X* = 2(A$) for 2$ > 0, then X* is efficient, 
This follows from the lemma: 


Suppose X* minimizes f= V — J;E buts not efficient. ‘Then there is 
a portfolio ¥ such that either 


YR)<VxX*) and EUR) > E(x") 


VR)V< VX) and ER) > EX). 
In either case (RP) <f{X*). Contradiction, 
Tueoren 2: If X* = £(0), X° is efficient. 


The Kuhn, Tucker results directly imply that, when Ay = 0, R(x) 
minimizes V subject only to 


—_ 


—— 


or else 


AX = b, 
X20. 


This does not prove that X* = X(0) is efficient since there might be another 
obtainable portfolio ¥ which has the same (minimum) V but has higher E 
than does X*. Suppose that such an £ existed. There exist obtainable 
Portfolios in all neighborhoods of X* and 2, since all portfolios satisfying 


X=OX*+(1-AX, OSd<1 


are obtainable. Since E and V are continuous, given any Ay > 0, there 
exist neighborhoods, T* and 7, respectively, of ¥* and ¥ such that, if 


yer? 
g «fT, 


and 


then 
fy") = AVY") — AgEY*) > VG) — AgEG) = fiG). 
But this is impossible since X* is the limit of portfolios which minimize Sf 
Corottary: If X* = 2(Ag) for 0 <A < 0, X* is efficient. | 


eee (E*, V°) is an efficient E, V combination, then there exists 
7 


Xm FAS), 
Et = p' x8, 
VP mn XP CR 
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Under the non-degeneracy assumptions of the present discussion (page 31 
for Ay sufficiently large, (Ap) = X, where X is the portfolio with 
mum obtainable E. When Ay =0, ¥(2,) is a portfolio with minim\ 
variance. Let £ be the maximum obtainable E; let £ be the E of 
efficient portfolio with minimum Y. No legitimate portfolio has E> 
no efficient portfolio has E< £. Since X(ip) and E(X) are continuo 
if E<Ey<E there is 4% >0 such that X°= X(Ap) has £°= w 
Since ¥° is efficient and since there is only one efficient (E, V) combinat 
with E = £°, the theorem follows. 


Tue Now-sincutarity oF My 


We shall continue to assume the non-degeneracy conditions outlined § 
page 312, Frequently statements in this and the following section m 
qualified by @ phrase such as “under the aforementioned non-dege! 
conditions.” We state this qualification here, once and for all, 
than repeat it at the beginning of each paragraph. The reader is 
again referred to [11] for the treatment of degenerate cases. 

The critical line method is said “‘to work” if 


(1) at each iteration My is non-singular; 

(2) the portfolio at 4 on the ith critical line is the same as that 
onthe (1 + !)st critical line; 

(3) nj > Oforj 4 F(t), X, > Oforj « $s), when Ag) > Ag > 

(4) AZ = O is reached for some finite T. 


Conditions (1) and (3) are satisfied on the first critical line. We shall 
that if they are satisfied on the /th critical line then they are satisfied 
(t + I)st critical line, and condition (2) is satisfied between the 
the (¢ + I)st critical line. 

In the present section we show that M ,, remains non-singular, 

First consider the case in which a variable X, “goes in" for 
(t+ 1), Without loss of generality we may assume that 
X,- «+, X_ were in at iteration f, and that Xcqy 18 to “go in” for 

Let 
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d= 


= Operas 


Along the th critical line 


[; AVX 0 a 

# olla] [oft (ih: 

At hp = an Tex becomes zero—n,,, goes from positive to zero. Thus 
at Ag = i there is a unique solution to 

Legere yriral iy ¢ 0 

eal Agila = 01 

40 0 jl-dp. 6 

If M scusay is non-singular, the formula for the (t + 1)st critical line will be 
Ly oeaey. CaN oe 0 

coo a'|l Xu | = [0]. 

Aad a b 
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But, since V(Y. 9) = 0, we must have 
wail Ye) 


otherwise V(X + 6 Y, X,,1 + By) would be negative for suitably chosen 8. 
Thus 


By hypothesis 
'y hypot an 
My = Pont is non-singular. 


Since 74, becomes zero on the /th critical line, 
WR + Y, Nea ty) = WR, Meade 


CA ft If 
a figs, | must be non-singular. AR + akin =, 
aw #€ then 


AR+ Y) + aXeq $y) = 5. 


We shall show that The classical Lagrangian results imply that the portfolio (X®) on the :th 


Cee Ag critical line at Ay (where 7,,, becomes zero) minimizes AV AE among 
Myuay=|e¢ o 2 | is non-singular, X which satisfy 
yoereeaty (1) AR + aX,1 = b; and 


Q) X;=Oforjak+2,-- A, 
If Myivsay Were singular, there would exist (Y, y, 0) # (0] such that 
Coc ALY 0 


c a a’ il y|=/0), 


(No restrictions are placed on the signs of X, for j=1,-- +k +1) 
This implies that 


(rs A) + peay = 0. 


Otherwise 


Aa olla} lo y y 
xo + or X°— 
where Y has k components, 6 has components, and y is a scalar. : : 


implies 
uyaeaS in| i =, would have higher £ and the same V (hence X° could not minimize 
y 4V — Age over the aforementioned set). But 
(cs a fe 
=- (4). =0 
E IL--( «nal 
Hence ely implies shiva 
WY,y) = wal 
¢ o}ly. EY. ST] © 8 prs! = 10, 0, OF 
a" AOL 0: 
= -1vaif } ; : q | 
i This contradicts the assumption that 
=0. Cd p 
For any (%, Xpa1). ; : Hees 
tod Y) CT ga" is non-singular. 
VOR. Nay +9) = VOR Kear) + 208 ead if “\(3) + ane i de 
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Next consider the case in which a variable that was in at the sth iteration: 
“goes out” for the (f+ I)st. Without loss of generality we shall assume 
that the variables ¥,,-- +, X¢q, were in at iteration f, and that Yi 
“goes out” for iteration (1 + 1). C,c. A, a, ft, ¥, and o remain as define 
above. 

On the rth critical line 


Cc AUR 0 a 
coo oN Xe] = [0] + | oer | Ae 
Aa olla 6 0 


By hypothesis the above matrix, Myx), is non-singular. At the point 
the 1th critical line at which ,., becomes zero we have 


Cace Arak, 0 


© 


a 


a peer |] Xena _ [0 
Oy SOMA > 
o 1 0 ojl-aJ Lo. 


Since, by hypothesis, X,, did become zero, the matrix of these equal 


must also be non-singular. 
c | 
Myun =| 2 
su =| 7 9 


We must show that 
is non-singular. The proof is similar to that of the previous case; 
If Myer) Were singular, there would exist (Y; 6°) such that 


[alld 
=0. 
4 odie 
Cy=—-A'4, 
AY =0, 
WY) = Y'CY 
=— Yd 
=O. 


Hence 


From this it follows that V(X + Y) = V(X) for any X. 
If AX = 6, then 
A(X + ¥) =Ax +056. 
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ge 
The portfolio [ ou on the mth critical line at Ay = dy (where X,4, became 
zero) minimizes 


=4UV—-AUE 
subject to a z 


AK = b, 
Xen 0, ue = 06 X= 0. 
(No restrictions are placed on the signs of X;,-~ -, Xp) Hence 
WY =0. 


Otherwise either 
X94 Yor 
Re-y 


would give lower f subject to the constraint AX = b. 
But, since a’ Y = 0, we have 


Cc df 
1Y'0,8, Y'all © 7 My 
ally Nats 0, 0, 0, o}. 
OTe 1) 


Contradiction. 
Thus, whether ¥,,; “goes in” or “goes out,” M yi,41) is non-singular. 


THE NON-NEGATIVITY OF X AND 7) 


Let us return to the case in which X,(j = 1,- + -,k + I)are inalong the 
‘th critical line but X,,. goes to zero at 2. Both the equations which 
define the rth critical line and those which define the (t + I)st include: 

X,=0 forj>k+2 and 
n=O forj<k. 
The defining equations of the two critical lines differ only in that 
Tey = 0 along the sth critical line and 
X41 = 0 along the (7 + 1)st critical line. 
The vector (X, A, Xe) at Ap on the tth critical line satisfies both 7... = 0 
and X,,, = 0. Hence the portfolio at 4 on the tth critical line is the 


same as that at Ji on the (t + 1)st critical line. A similar ar; 
; ‘gument 
applies when X,, “goes in” at AS. In this case ji 


%14, = 0 along the rth critical line and 
Ter = 0 along the (t + 1)st critical line. 
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Again the vector (X, A, 4.) at A on the rth critical tine satisfies bout 
hence the portfolio at Ai, on the rth critical line is the same as that at ay. 
on the (t + 1)st critical line. ‘ 
We now show that, if M/y,,) is non-singular, and if 
X,>0 forje $(0), 
n>O forj¢ F(0, 
when Ab“) < Ap < dp on the sth critical line, then 
X,>0 forje $+), 
ny >0 forj¢ Fl +1). 
for ify > Ag > Aft” on the (r + I)st critical line. We have already 
that M giz is non-singular under the above assumptions. We shall fi 
consider the case in which a variable ‘goes in.” As before, we sI 
assume that variables X,,- > -, ¥, are in on the ‘th critical line and 
“goes in" for the (1 + 1)st. ‘ 
‘On the (t + 1)st critical line, at 4g = 7, we have 
X,>0 forj= ty: +k 
Xin = 9 
as well as 
x, 
We must show that, as Ag is lowered, X44, increases. 
Suppose that on the (¢ + 1)st critical line 
AX 
dy 
Then all points on this critical line satisfy 
(1) Xess = 05 
(2) X; = Oforj >k + 2; also 
Q) 9; = Oforj = 1,- > +, k; as well as 
(4) ter = 0. 2 
jis is i ible si ditions whic 
But this is impossible since (1), (2), and (3) are the con 1s whic! 
the rth critical line. By hypothesis, (4) holds on the th critical line 
at Ap = Ay. Thus either 
AX es 


=0 forjakt2,-+-" 


0. 


dXyy 
— <0. 
ag ay 


‘Only in the latter case does the method “work.” Suppose AX iy s[dAg 
then there is 1, such that 


>0 oF 


AEM > hp > Me 
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and such that on the (¢ + I)st critical line at Zp we have 
X,>0 forje $0 +1), 
X,=0 forj¢ f(t +1), 
mg >0 forjy $i +1), 
1 =0 forje $0 +1). 
Since M 4,41) is non-singular, the portfolio at A on the (¢ + L)st critical 
line uniquely minimizes 
WV —AgE 
subject to 
AX =, 
X,=0 forj>k+2 


(no restrictions on the signs of the X,). But the point on the ¢th critical 
line at Ap = Ly gives minimum obtainable %V — igE. Contradiction. 


A similar argument shows that, when Y,,, “goes out,” drpyilddp <0 
on the (f + 1)st critical line. 


FINITENESS 


Iona particular critical line a variable (X, or n,)is positive for Ag > At 
and zero for iy = Ay, then it must be negative for Ag <A. It follows 
that no critical line can occur more than once in tracing X(Ag). Since 
there are only a finite number of possible critical lines, 4, = 0 is reached 
in a finite number of iterations, 


E, S, EFFIcient Sets 


The computing procedures for obtaining £, S, efficient portfolios, 
described in Chapter IX, are almost the same as those for the E, V 
efficient set problem, In the £, S, case, as in the E, V case, an efficient 
portfolio £(A,) is associated with each non-negative value of Ay. As 
before, £(A,:) is piecewise linear. 

The procedures differ in the following respect: A} is the largest value of 
Ap which satisfies 

(1) AY > A, and 

(2) either 

(i) X,=0 forsomeje ¥(t), or 
Gi); =0 forsome jy f(t), or 
(iii) Ap=0, or 

(iv) px =0 
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where p, is an n component column vector whose jth component is 
p,, = (the return of the jth security in year 7) — (6). 


In cases (2i), (2ii), and (2iii) the procedure is the same as in the £, V com- 


putation. In case (2iv) the matrix 
M. [ : 
only 
becomes 
Cut 
My = Fr 


where either 


Cin +h 
Con = Ce 


4 iti ive for Ap > Ab. (By 
depending on whether p/X was positive or negative for Ag 
ae car on &, where ¥, C,etc., remain as defined above. In the be, 
discussion we may write p instead of A, without danger of confusior i r 
We shall make essentially the same non-degeneracy assumptions 
present discussions as were made in the E, Vcase. In particular we 
assume 


or else 


(a) that the linear programming problem, to maximize E, has & 

on-degenerate solution, and 5 ‘ 
: (b) ha there is a unique variable (X;, n» oF p;X) which determi 
fort <T. 


We shall say that the procedure for generating £, S, efficient 
“works” if 


j ‘ Bee 
(1) M gc is non-singular for t= 1,- > *, 
(2) the point at Ay = dy on the sth critical 

Ag = Xe on the (t + l)st critical line; 

“Ga) x, > 0 for je $0), 4 > 0 for jy F(t), for 

the tth critical line; 

(3b) ifp'X = Oat Ap = Ae, 


line is the same as 


o> Ag 


pX <0 for A“) > Ay > dp 

implies X>0 for Pies age 
1 

and conversely Pee 
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implies 


p'X <0 for de> ag > aD; 
and 


(4) AE = Os reached for finite T. 


‘We shall first show that, ifthe Procedure “works,” it produces the desired 
efficient sets. We then show that the procedure does work. 


Tue E, S, EFriciency oF THE Ponts GENERATED. 


SX) is the expected value of a function g(rX), where g is convex and r 


israndom. This implies that 5,(X) is itself a convex function of Y. The 
convexity of S(X) implies the convexity of 


L= SX) Age. 


The convexity of f implies that, if a portfolio provides a local minimum, it 

also provides a global minimum for f. But the Kuhn, Tucker conditions 

assure us that ¥(4,) provides a local, and hence a global, minimum to 
YS(X) — Ape. 


The minimization of % SX) — AyE implies efficiency; and the argu- 
ment proceeds from here as in the E, V discussion. 


THAT THE E, S, Proceoure “Works” 


Incase an X, or n, goes to zero, the procedure is the same as in the EV 
efficient set computation, We have already seen that the procedure 
“works” in this case. 

In this section we show that the E, ‘Sy procedure “works” when a p/X 
goes to zero. Parts of the proof which are very similar to corresponding 
Parts of the proof in the E, V case will not be spelled out in detail. 


By hypothesis, 
i GoAqy 
ORV y is non-singular, 


Since p’X became zero, 


Ps) 
oot 
oot 


= 
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must also be non-singular. We shall show that the matrix 


i +R 
a 


Finally we must show that, if p'X = 0 at 2 on the sth critical line, then 


an ee 3 eX <0 forii-Y> a, > dt 

| is non-singular. ai ‘3 es 
PX>0 for de> dp > ie 

and conversely | 


p'X>0 ford > de> A 


Essentially the same argument will apply to 


erage 


C+ pe ‘| 
[a 


implies 
PX<O for dy > dp > MAY, 
This follows as a corollary of the following, 


Lemma: If ¥2D uniquely minimizes 


were singular, there would exist pate y ig that age ace “he 
Cath é AR = b, 
[ a Sllely and X minimizes 
‘i ak WATCH + ¥'pp'X) — iE 
subject 10 
In the same manner as in the E, V case, we can show that ae 


¥(C + pp)¥ =0, then g'¥ 1 > Oimplies p'X2 > 0. (The generalization toM(L'CX — X'pp' F) 


d, from this, that —ZpE, or p'¥P <0, is immediate.) 
and, from this, 


a 1 aUd Suppose that j'¥2 > 0 but p’'X2 <0, Consider the straight line which 
But this implies that Cia passes through ¥ and ¥*. Each point on this line is characterized by 
f' i its value of y= 6'X. Along this line f= % &’CX — A, is a quad- 

(Yo, YA]A 0 0] = (00,0). ratic which may be written | 

x0 0 S=at Koyo, | 


Contradiction. A 

If p'X = Oat dy = Ai, on the rth critical line, then the portfolio 

‘on the rth critical line is the same as that at Ay = Aig on the (¢ 
critical line, since in the former case we have 


[ eIE)- [th 


while in the latter case we have 


CleP del 


where b> Oandc =p ¥2 > 0, 
We also have 


Ppp k = (Xp) = y' 
Thus 


St HR pp 
is of the form 


a+ BY — cP + hy’, 
which has its minimum at 


contradicting the assumption that 


G= RR? <0. 
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ve lemma implies that, if p’X would have become negative 

torte <a along the aid critical line defined in terms of (es then e a 

become negative along the new critical line defined in terms of ite 

Ag = 0 is reached in a finite number of iterations since any on Ma, 
of in variables and years can give rise to at most one segment o} 
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and there are a finite number of such combinations. 


A SIMPLEX METHOD FOR THE PORTFOLIO 
SELECTION PROBLEM 


Philip Wolfe [13] has shown that a slight variant of the simplex method. 
can be used to solve the problem 
minimize f(z) = 2'Cr, 
z>0, 
Ax = b, 
where 2 is an m by 1 matrix, 2’ is its transpose, C is an n by n positive 
semi-definite matrix, and A is an m by n matrix. 

To solve the above problem by Wolfe's technique the simplex procedure 
must be modified so that certain pairs of variables are not allowed in the 
basis simultaneously. 

In the portfolio selection problem f(z) is minimized subject to the 
constraints 

Az = b, 
220, 
and 
HEE, 
where yw is a 1 by m matrix and E isa scalar. E is not fixed in value; 
rather the problem is to find min f for all possible values of E. 

This Appendix shows that the simplex method, with Wolfe’s amendment, 
can be used to solve the whole portfolio selection problem. Since one by- 
Product of the portfolio selection computation is the point x which mini- 
mizes f(z) subject only to Ax = b and x > 0, the Procedure presented in this 
Appendix also provides an altemative method of sclving this problem. 

We define 
— WU/2Yfla) + Adz — Ayu 

1 
= Jean + Thay + Asay 
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where cy is the jkth element of C, a,, is the jth clement of Ay ly is the jth 

element of 4; and Aisa 1 by m matrix of Lagrangian multipliers whose ith 
nent is Ay. 

rhe eritieal line method of quadratic programming uses the fact that 

there is a piecewise linear set of points (z) which give minimum fia) for 

each value of E. Associated with any linear segment of this set isa set of 

variables J = (jy.°* ‘sjy). Along the linear segment we have 


on for all j, 
2,20. 

ya forjef, 
x,=0 forjyS. 


‘We also have 
Fs 
A =r 0] fe 
<1 f}-0-2 
Ao 0 5 0. 
4, 
In the critical line procedure Ay is reduced until some 1, 4 F oF 2 SJ 
reaches zero. Ifan n,, reaches zero first, jo “goes into the set ¥, so that 
along the next critical line 
Tat) 


as well as 
1, =0 for jin the old ¥. 


Conversely, if an 2, reaches zero first, jg “goes out” of ¥ for the 
critical line. ‘The computing procedure continues until Ay = 0 is reac! 
At this point /(2) is minimized subject only to Ax = 6,2 = 0. 
Consider the “amended” linear programming problem: 
minimize Ap 
Fa 


=! ‘“ 0 
= ds=| | 
0 ] ; (i rm (i 
1. 
where (I) 7) and 2, can never be in a basis simultaneously; (2) A 
(y+ « * Aq) is not constrained to be non-negative; and (3) ay 


are always in the basis. A basis to this problem contains (m + 
variables, As long as Ag > 0, the basis must consist of 


“ 
ibject to 
ee A 0 


Igy 
Ay + Any and 
n—1 of the z, and n;. 
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(We shall assume that there are no degenerate bases. This is a convenient 
‘but unessential assumption; see [11] concerning degeneracy in the critical 
Tine method.) By construction, if z, is in the basis, then », is not; and 
vice versa. Since there are n — | of the z, or 7, in the basis, there can be 
only one jy for which both its z,, and 7,, are notin the basis. According to 
the amended procedure, one of these must go into the basis. 

As will be discussed below, the introduction of one of these into the 
basis will increase Ag; the introduction of the other will decrease Ag. 
‘The variable (z,, or 1y,) which decreases Ag is introduced into the basis by 
increasing its value until some other variable (z, or 7, with j # jq) goes to 
zero. This again leaves us with one j with z,, and 7,, out of the basis, 
The procedure is repeated until Ay = Ois reached. 

Comparison of the amended simplex computation and the critical line 
Procedure shows that, if they are started out together, they will continue 
together.! x and 7 vary with A, along a critical line by exactly the same 
formula that relates them to Ag when the new variable is introduced into 
the basis. The sequence of z's of 1's which go to zero, and that of their 
counterparts which become non-zero, are exactly the same. 

The proof that the critical line method works, therefore, is a proof that 
the amended simplex method produces the desired results. It is this 
equivalence between the two procedures which implies that, when 2,, and 
n,, are both outside the simplex basis, one will increase A, while the other 
will decrease it. 

The practical implication of the above result is that any of the linear 
Programming codes for high speed, internally programmed computers can 
be conveniently converted to a quadratic programming code for solving, 
among other things, the portfolio selection problem. 


2 See [11] for starting the portfolio problem. 


—— 
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ALTERNATIVE AXIOM SYSTEMS 
FOR EXPECTED UTILITY 


In Chapter X we showed that the expected utility maxim is implied by 
the following three axioms: 

Axiom la, For any probability vectors P, Q, 

P>Q or Q>P (or both). 

Axiom Ib, IfP > QandQ@ > R, then P > R. 

By definition 
P~QmeansP2>Q and Q2P; 
P>QmeansP>Q and nw Qg>P; 
P<Qmeans Q>P. 

Axiom Il: If P > Q, then 
aP +(1—aR>aQ+(1-@R 
for all probability vectors R, and 
O<a<i. 

Axiom IIL: If P > Q > R, then there is an a such that 

aP+(1-aR~Q. 

We shall show that the expected utility maxim still follows if 
replaced by either III’ or III" below: 

‘Axiom HI’: IEP > Q > R, then 

(a) there is an a < 1 such that 2P + (I — @)R > Q, and 

(b) there is a B > 0 such that Q > BP + (1 — AR. 

Axiom III": Let ¢; be the ith unit vector: 

(a) there exist ¢;, ¢;, and ¢ such that 

rar ty and 
340 
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(b) for any triplet i, j, k such that 
there exists an « such that i 
e wae, + (I — ade. 
To show that I, II, and III’ imply the expected utility rule, 

' ry rule, we need onl 
show that they imply III. We shall therefore assume I, II, III’ aa 
P>Q> R; weshall show that there is an & such that 

aP+(1—aR~Q. 
Axiom II implies that, if } 2 
aP+(1—®)R>Q, 


then 

we BP + (1 — plaP + (1 — «)R| > Q; 
Bn, aP+(1—@R>Q, 
Pi a=f-1+(1—f)e for0<fsi. 


«P+(1—«)R>Q, 
then the same is true for all @ > a. Conversely, if 
aP +(1—a)R<Q, 
then the same is true for all % <a. This, together with Axio: ? 
implies that there is an @ such that at ai aad 
@o<a<1, 
Gi) @ is the greatest lower bound of « satisfying 
aP+(1—a@R> Q, 
(iii) @ is the least upper bound of « satisfying 
mi aP+(1—aR<9Q. 
@P+(1—@R>Q? or 
aP+(1—&R<Q? or 
aP+(1—8)R~Q? 
‘We shall show that the first two cannot hold; i u 
Mea old; hence Axioms I, II, and III’ 
&P + (1— 8)R> Q, 


there would exist 6 < 1 such that. 
AlaP + (1 — OR) +(1- AR> OQ: 


Wisi al a 
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that is The corresponding to P, (i = 1,- > +,m — 1) are linearly independent. 
paP + (1 — a)R> O. This may be seen by noting the location of non-zeros in the (m ~ 1) by 


(m — 1) matrix whose ith column is 


This contradicts the assertion that @ is the g.l.b. of the a which satisfy 


x ponaes 
aP + (1— a)R> Q. ; 
A similar contradiction follows from the assumption that a : 
RP +(1—-aR< QO. : 
2 
Hence I, II, and II’ imply IIL. a 
A different approach is used to show that I, II, and III" imply the 4 : 


expected utility rule. 
If P is an m component probability vector As illustrated by the x's above, the non-zeros appear along the diagonal 
and (perhaps) in the first row. (If this were a matrix of P, instead of B, 
there might also be non-zeros in the mth row.) The determinant of the 
matrix is the product of the diagonal elements, and thus is non-zero. The 
P,, therefore, determine a plane y in the P space. 
As the reader may confirm, Axioms I and II imply 


(a) if @ and R are in p, then Q ~ R; 


(b) if @ and R are in a plane y’ parallel to y, then Q ~ R, and 
(c) 1, Il, and Il” imply the expected utility rule. 


Py 


then we will let P be an (m — 1) component vector 
Pr 

We have discussed axiom systems which imply the expected utility rule. 
Below we show that, if we weaken the axioms in certain respects, we can- 
not derive the expected utility rule. None of the axiom systems discussed 
below is inconsistent with expected utility. Their “drawback” is that they 
are consistent with other rules as well, They are not strong enough to 
rule out certain preference systems which are ruled out by Axioms I, II, 
and IIT, MI’, or HI", 

Axiom III” would be a genuine weakening of III except for the inclusion 
of part (a). Part (a) is necessary, as may be shown by an example in 
which I, I, and part (b) of III" are satisfied, but in which the expected 
utility rule does not hold. 

Suppose that there are three outcomes, and that P = (Pi Po Pa) is 
preferred to Q = (gy, 43,43) if 


(a) Ps <q, or 
(b) Ps = 43 > Oand p, > gy. 


P~ Q if and only if 


with p,, omitted. It will be convenient to assume that 
et et oe 
Let the kth outcome be such that 
1 > te > Cw 


Part (a) of III” assures us that such a k exists, For i= 1,-- +m 
define a P, in one of three ways: 


if e, ~ e,, then P, = e, ; 

if e;>e, then P, = ae,+(1 —a)eq, where « is chosen 40 
Pi~e, 

Nie eg heath = az, + (1 — a)e,, where a is chosen so that P, =; 


We thus have (m — 1) points P, such that 


Powe 


Ps= qs = 0. 
The reader can confirm that the example has the desired attributes. 
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Part (b) of III’ is a weakening of III. One might conjecture that a 
similar weakening could be applied to III’, We shall show, however, that 
preferences inconsistent with the expected utility maxim can be consistent 
with Axioms I, Il, part (a) of III, and the following modified version of 
part (b) of III": 

MIl"b': Ife; > ¢; > en, then 

(i) there is an a # 1 such that 

ae, +(1—adee> ey and 
(ii) there is a B 4 O such that 
Be; + (1 — Boer < es 

Suppose that there are three outcomes such that € > €3 > ¢s- P= 

(Pa. Pas Ps) is preferred to O = (91, ga. 9s) if 


(1) pi— Pe> i — qv OF 
Q) pr — Pe=% — Je and 
A> he 

The reader can confirm that the preferences in the above example satisfy 
I, Il, part (a) of III’, and the modified part (b') of III’, but they do not 
satisfy the expected utility maxim 

In another axiomatic system [26] it was only necessary to postulate an 
axiom similar to II for the case a = 1/2, This suggests that perhaps the 
expected utility rule could still be derived if, in our present axiom structure, 
II were replaced by 


W: If P > Q, then 
(UDP + (/2)R > (1/2 + (1/2)R 


for any probability vector R. We shall show, however, that {, II’, and: 
either III, III’, or III” are consistent with preference patterns which are not 
consistent with expected ut Our examples will depend on Zorn’s 
Lemma or, equivalently, the Axiom of Choice. 

Suppose that there are two outcomes. A probability distribution, then 
is characterized by p = p,. In this case the expected utility rule is equivay 
lent to a requirement for a monotonic ordering of 0< p< 1. Either we 
have 


(a) pis preferred to g when it is numerically greater, 

(b) p is preferred to g when it is numerically less, or 

(©) p ~@ for all p, q. 

Our discussion will be based on the concept of a’Hamel Basis wl 
in turn, is based on Zorn’s Lemma. We shall state Zorn’s Lemma, shi 
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the existence of a particular kind of basis for the numbers 0 <p < I, and 
finally show that a non-monotonic ordering of 0< p< is consistent 
with Axioms J, II’, and 111 or TI’, (An example involving IIT” will be 
discussed subsequently.) 
A set P is “partially ordered” with respect to a relation > 
(1) ifa Sa forallae P; 
(2) for a,b « P, 
if a>b and boa, 
then a =}, and 
(3) for a,b, ce P, 
if a>b and b>c, 
then adc. 


A subset Sc Pis linearly ordered ifa¢SandbeS implya>borb<a. 
ze Pis an upperbound of Sif 


z>a forallaeS. 


(z need not be in S.) y«P is a maximal element if there does not exist 
aeP,a#y, such thata>y. 


Zorn's Lemma: If P is a partially ordered set such that every linearly ordered 
subset fae an ipperbeund, then Phas diseammatslanede 


Let R be any subset of the real numbers r: O<r<1. We shall say 
that the numbers in R are “independent” (with respect to the rationals) if 
there docs not exist a finite linear combination 


> a7, = 0, 
a 


with r, «R,n>1,andq,,* - +, a, non-zero rational numbers. (R may be 
infinite, but Zar is a finite sum involving a subset from R.) We shall say 
that R is a basis for the numbers 0 <7<1 


(a) if R is a set of independent numbers satisfying 0 <r<1, and 
(b) if0 <y <1, then 


y= Day, 


im 


for suitably chosen rational a, # 0 and 7, ¢ R. 


Ina moment it will be shown that Zorn's Lemma implies that such a basis 
exists, We now note that, if R is such a basis, the a,r; in 


y= Lar, 
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are unique. Otherwise we would have 


Thus 
Yar, — 247, = 0, 


contradicting the independence assumption. 

Let # be the set of all independent sets (S) of pointsind<r<l, We 
shall define S, > 5,to mean S, > S, # is partially ordered with respect 
to >. Suppose J isa linearly ordered subset of 9; thatis, if S, <I and 
SyeF, then 


SS, “or SS, 


The union of all the Se.%, US, is in-?: for otherwise there would exist 
a finite sum, 
Zar, = 0, 

with the a, non-zero rationals and r,« US. But this would also hold true 
for some SeF. Thus every linearly ordered subset of ? has an upper 
bound, Hence ¥ has a maximal element (R). But R is a basis for 
O<r<l since 

(a) Re # implies that R is a (rationally) independent set; and 

(b) if y¢R but O<y<l, then there exist non-zero rationals 
ay,- «+54, andr, €R such that 


ya Late 


otherwise the union of {y} and R would be (rationally) independe 
contradicting the assertion that R is a maximal element of F. 


We define a non-monotonic ordering function on 0 < p< | as follows: 


Let U(0) = 0. 
Forr ¢ R, U(r) = any rational, provided only that U(r) is not monot 


Fory¢R, 


uy) = 3 aud, 
where 
eR 
and 
asa 


Consider the ordering, p is preferred to q if and only if 
Up) > UUq)- 
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This ordering is not consistent with the expected utility rule, since it is not 
monotonic. It is, however, consistent with Axioms I, II’, III, and III’; 


Axiom 1 is satisfied trivially. 
Axiom II’ is satisfied since 
Ul(1P2)p + (/2)r) = (1/2)U) + (1/2) U(r) 
for all O<p,r< 1. 


That Axiom III is satisfied may be seen as follows: Suj i 
+ Suppose that p is 
preferred to q and q is preferred tor. U(p), U(g), and U(r) are raat 


such that 
Up) > Ug) > U(r). 
« “Toon is a rational. 


Ulap + (1 — a)r] = aU(p) + (1 — a)U(r) = U(Q). 


The reader can readily verify that the ordering is also consistent with III’. 
Since ere example involved ‘only two possible outcomes, it could 
not satisfy III", Consider, however, the followi Let 
ot sat lowing example: Let U(p) be 


U(1) > 0. 
Suppose that there are three outcomes whose probability vectors P = 
(P1» Pas Ps) ate preferred according to the following ordering function: 


(Py, Px) = U(r) — Ups). 


This ordering is clearly inconsistent with the 43 
theless: v Bi expected utility rule, never- 


IIT’ is satisfied since 


(a) e, > ey > ep and 
(b) ¢5 ~ (1/2)e, + (1/2)es. 


1 is satisfied trivially; and 
11's satisfied since we still have 


VUU2YP + (1/2)Q] = (1/2)H(P) + (1/2)H(Q). 
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